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BACKGROUND 

The blood is a vital part of the human circulatory system for the human body. 
5 Numerous cell types make up the blood tissue including monocytes, leukocytes, lymphocytes 
and erythrocytes. Although many blood cell types have been described, there are likely many 
as yet undiscovered cell types in the human blood. Some of these undiscovered cells may exist 
transiently, such as those derived from tissues and organs that are constantly interacting with 
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the circulating blood in health and disease. Thus, the blood can provide an immediate picture 
of what is happening in the human body at any given time. 

The turnover of cells in the hematopoietic system is enormous. It was reported that 
over one trillion cells, including 200 billion erythrocytes and 70 billion neutrophilic 
5 leukocytes, turn over each day in the human body (Ogawa 1993). As a consequence of 

continuous interactions between the blood and the body, genetic changes that occur within the 
cells or tissues of the body will trigger specific changes in gene expression within blood. It is 
the goal of the present invention that these genetic alterations be harnessed for diagnostic and 
prognostic purposes, which may lead to the development of therapeutics for ameliorating 
10 disease. 

For example, isoformic myosin heavy chain genes are known to be generally expressed 
in cardiac muscle tissue. In the rodent, the pMyHC gene is only highly expressed in the fetus 
and in diseased states such as overt cardiac hypertrophy, heart failure and diabetes; the aMyHC 
gene is highly expressed shortly after birth and continues to be expressed in the adult heart. In 

15 the human, however, PMyHC is highly expressed in the ventricles from the fetal stage through 
adulthood. This highly expressed pMyHC, which harbours several mutations, has been 
demonstrated to be involved in familial hypertrophic cardiomyopathy (Geisterfer-Lowrance et 
al 1990). It was reported that mutations of PMyHC can be detected by PCR using blood 
lymphocyte DNA (Ferrie et ah, 1992). Most recently, it was also demonstrated that mutations 

20 of the myosin-binding protein C in familial hypertrophic cardiomyopathy can be detected in 
the DNA extracted from lymphocytes (Niimura et al, 1998). 

Similarly, APP and APC, which are known to be tissue specific and predominantly 
expressed in the brain and intestinal tract, are also detectable in the transcripts of blood. These 
cell- or tissue-specific transcripts are not detectable by Northern blot analysis. However, the 
25 low number of transcript copies can be detected by RT-PCR analysis. These findings strongly 
demonstrate that genes preferentially expressed in specific tissues can be detected by a highly 
sensitive RT-PCR assay. In recent years, evidence has been obtained to indicate that 
expression of cell or tissue-restricted genes can be detected in the certain peripheral nucleated 
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blood cells of patients with metastatic transitional cell carcinoma (Yuasa et al 1998) and 
patients with prostate cancer (Gala et al 1998). 

In the prior art, there is a need for large samples and/or costly and time-consuming 
separation of cell types within the blood (Kimoto (1998) and Chelly et al (1989; 1988)). The 
5 prior art, however, is deficient in non-invasive methods of screening for tissue-specific 
diseases. The present invention fulfills this long-standing need and desire in the art. 

SUMMARY OF THE INVENTION 

The present invention relates generally to the molecular biology of human diseases. 
10 More specifically, the present invention relates to a process using the genetic information 
contained in human peripheral whole blood for the diagnosis, prognosis and monitoring of 
genetic and infectious disease in the human body. 

This present invention discloses a process of using the genetic information contained in 
human peripheral whole blood in the diagnosis, prognosis and monitoring of genetic and 
15 infectious disease in the human body. The process described herein requires a simple blood 
sample and is, therefore, non-invasive compared to conventional practices used to detect tissue 
specific disease, such as biopsies. 

The invention is based on the discovery that gene expression in the blood is reflective 
of body state and, as such, the resultant disruption of homeostasis under conditions of disease 
20 can be detected through analysis of transcripts differentially expressed in the blood alone. 
Thus, the identification of several key transcripts or genetic markers in blood will provide 
information about the genetic state of the cells, tissues, organ systems of the human body in 
health and disease. 

The present invention demonstrates that a simple drop of blood may be used to 
25 determine the quantitative expression of various mRNAs that reflect the health/disease state of 
the subject through the use of RT-PCR analysis. This entire process takes about three hours or 
less. The single drop of blood may also be used for multiple RT-PCR analyses. It is believed 
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that the present finding can potentially revolutionize the way that diseases are detected, 
diagnosed and monitored because it provides a non-invasive, simple, highly sensitive and 
quick screening for tissue-specific transcripts. The transcripts detected in whole blood have 
potential as prognostic or diagnostic markers of disease, as they reflect disturbances in 
5 homeostasis in the human body. Delineation of the sequences and/or quantitation of the 
expression levels of these marker genes by RT-PCR will allow for an immediate and accurate 
diagnostic/prognostic test for disease or to assess the efficacy and monitor a particular 
therapeutic. 

One object of the present invention is to provide a non-invasive method for the 
10 diagnosis, prognosis and monitoring of genetic and infectious disease in humans and animals. 

In one embodiment of the present invention, there is provided a method for detecting 
expression of a gene in blood from a subject, comprising the steps of: a) quantifying RNA 
from a subject blood sample; and b) detecting expression of the gene in the quantified RNA, 
wherein the expression of the gene in quantified RNA indicates the expression of the gene in 
15 the subject blood. An example of the quantifying method is by mass spectrometry. 

In another embodiment of the present invention, there is provided a method for 
detecting expression of one or more genes in blood from a subject, comprising the steps of: a) 
obtaining a subject blood sample; b) extracting RNA from the blood sample; c) amplifying the 
RNA; d) generating expressed sequence tags (ESTs) from the amplified RNA product; and e) 
20 detecting expression of the genes in the ESTs, wherein the expression of the genes in the ESTs 
indicates the expression of the genes in the subject blood. Preferably, the subject is a fetus, an 
embryo, a child, an adult or a non-human animal. The genes are non-cancer-associated and 
tissue-specific genes. Still preferably, the amplification is performed by RT-PCR using 
random sequence primers or gene-specific primers. 

25 In still another embodiment of the present invention, there is provided a method for 

detecting expression of one or more genes in blood from a subject, comprising the steps of: a) 
obtaining a subject blood sample; b) extracting DNA fragments from the blood sample; c) 
amplifying the DNA fragments; and d) detecting expression of the genes in the amplified DNA 
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product, wherein the expression of the genes in the amplified DNA product indicates the 
expression of the genes in the subject blood. 

In yet another embodiment of the present invention, there is provided a method for 
monitoring a course of a therapeutic treatment in an individual, comprising the steps of: a) 

5 obtaining a blood sample from the individual; b) extracting RNA from the blood sample; c) 
amplifying the RNA; d) generating expressed sequence tags (ESTs) from the amplified RNA 
product; e) detecting expression of genes in the ESTs, wherein the expression of the genes is 
associated with the effect of the therapeutic treatment; and f) repeating steps a)-e), wherein the 
course of the therapeutic treatment is monitored by detecting the change of expression of the 

10 genes in the ESTs. Such a method may also be used for monitoring the onset of overt 

symptoms of a disease, wherein the expression of the genes is associated with the onset of the 
symptoms. Preferably, the amplification is performed by RT-PCR, and the change of the 
expression of the genes in the ESTs is monitored by sequencing the ESTs and comparing the 
resulting sequences at various time points; or by performing single nucleotide polymorphism 

15 analysis and detecting the variation of a single nucleotide in the ESTs at various time points. 

In still yet another embodiment of the present invention, there is provided a method for 
diagnosing a disease in a test subject, comprising the steps of: a) generating a cDNA library for 
the disease from a whole blood sample from a normal subject; b) generating expressed 
sequence tag (EST) profile from the normal subject cDNA library; c) generating a cDNA 
20 library for the disease from a whole blood sample from a test subject; d) generating EST 

profile from the test subject cDNA library; and e) comparing the test subject EST profile to the 
normal subject EST profile, wherein if the test subject EST profile differs from the normal 
subject EST profile, the test subject might be diagnosed with the disease. 

In still yet another embodiment of the present invention, there is provided a kit for 
25 diagnosing, prognosing or predicting a disease, comprising: a) gene-specific primers; wherein 
the primers are designed in such a way that their sequences contain the opposing ends of two 
adjacent exons for the specific gene with the intron sequence excluded; and b) a carrier, 
wherein the carrier immobilizes the primer(s). Preferably, the gene-specific primers are 
selected from the group consisting of insulin-specific primers, atrial natriuretic factor-specific 
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primers, zinc finger protein gene-specific primers, beta-myosin heavy chain gene-specific 
primers, amyloid precursor protein gene-specific primers, and adenomatous polyposis-coli 
protein gene-specific primers. Further preferably, the gene-specific primers are selected from 
the group consisting of SEQ ED Nos. 1 and 2; and SEQ ID Nos. 5 and 6. Such a kit may be 
5 applied to a test subject whole blood sample to diagnose, prognose or predict a disease by 
detecting the quantitative expression levels of specific genes associated with the disease in the 
test subject and then comparing to the levels of same genes expressed in a normal subject. 
Such a kit may also be used for monitoring a course of therapeutic treatment or monitoring the 
onset of overt symptoms of a disease. 

10 In yet another embodiment of the present invention, there is provided a kit for 

diagnosing, prognosing or predicting a disease, comprising: a) probes derived from a whole 
blood sample for a specific disease; and b) a carrier, wherein the carrier immobilizes the 
probes. Such a kit may be applied to a test subject whole blood sample to diagnose, prognose 
or predict a disease by detecting the quantitative expression levels of specific genes associated 

15 with the disease in the test subject and then comparing to the levels of same genes expressed in 
a normal subject. Such a kit may also be used for monitoring a course of therapeutic treatment 
or monitoring the onset of overt symptoms of a disease. 

Furthermore, the present invention provides a cDNA library specific for a disease, 
wherein the cDNA library is generated from whole blood samples. 

20 In one embodiment of the present invention, there is a method of identifying one or more 

genetic markers for a disease, wherein each of said one or more genetic markers corresponds to a 
gene transcript, comprising the steps of: a) determining the level of one or more gene transcripts 
expressed in blood obtained from one or more individuals having a disease, wherein each of said 
one or more transcripts is expressed by a gene that is a candidate marker for disease; and b) 

25 comparing the level of each of said one or more gene transcripts from said step a) with the level 
of each of said one or more genes transcripts in blood obtained from one or more individuals not 
having a disease, wherein those compared transcripts which display differing levels in the 
comparison of step b) are identified as being genetic markers for a disease. 
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In another embodiment of the present invention, there is a method of identifying one or 
more genetic markers for a disease, wherein each of said one or more genetic markers 
corresponds to a gene transcript, comprising the steps of: a) determining the level of one or more 
gene transcripts expressed in blood obtained from one or more individuals having a disease, 
5 wherein each of said one or more transcripts is expressed by a gene that is a candidate marker for 
a disease; and b)comparing the level of each of said one or more gene transcripts from said step 
a) with the level of each of said one or more genes transcripts in blood obtained from one or 
more individuals having a disease, wherein those compared transcripts which display the same 
levels in the comparison of step b) are identified as being genetic markers for a disease. 

10 In another embodiment of the present invention, there is a method of identifying one or 

more genetic markers of a stage of a disease progression or regression, wherein each of said one 
or more genetic markers corresponds to a gene transcript, comprising the steps of: a) determining 
the level of one or more gene transcripts expressed in blood obtained from one or more 
individuals having a stage of a disease, wherein said one or more individuals are at the same 

15 progressive or regressive stage of a disease, and wherein each of said one or more transcripts is 
expressed by a gene that is a candidate marker for determining the stage of progression or 
regression of a disease, and; b) comparing the level of each of said one or more gene transcripts 
from said step a) with the level of each of said one or more genes transcripts in blood obtained 
from one or more individuals who are at a progressive or regressive stage of a disease distinct 

20 from that of said one or more individuals of step a), wherein those compared transcripts which 
display differing levels in the comparison of step b) are identified as being genetic markers for 
the stage of progression or regression of a disease. 

In another embodiment of the present invention, there is a method of identifying one or 
more genetic markers of a stage of a disease progression or regression, wherein each of said one 

25 or more genetic markers corresponds to a gene transcript, comprising the steps of: a) determining 
the level of one or more gene transcripts expressed in blood obtained from one or more 
individuals having a stage of a disease, wherein said one or more individuals are at the same 
progressive or regressive stage of a disease, and wherein each of said one or more transcripts is 
expressed by a gene that is a candidate marker for determining the stage of progression or 

30 regression of a disease, and b) comparing the level of each of said one or more gene transcripts 
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from said step a) with the level of each of said one or more genes transcripts in blood obtained 
from one or more individuals who are at a progressive or regressive stage of a disease identical 
to that of said one or more individuals of step a), wherein those compared transcripts which 
display the same levels in the comparison of step b) are identified as being genetic markers for 
5 the stage of progression or regression of a disease. 

Further embodiments of the methods described in the previous four paragraphs include 
the embodiments wherein each of said one or more markers identifies one or more transcripts of 
one or more non immune response genes, wherein each of said one or more markers identifies a 
transcript of a gene expressed by non-blood tissue, wherein each of said one or more markers 

10 identifies a transcript of a gene expressed by non-lymphoid tissue, wherein said one or more 
markers identifies a sequence selected from the sequences listed in any one of Table 3A-Z and 
Tables 3AA, 3 AB, 3AC and 3 AD, wherein said one or more markers identifies the sequence of 
one or more of the sequences selected from the group consisting of ANF, ZFP and pMyHC, 
wherein said blood comprises a blood sample obtained from said one or more individuals, 

15 wherein said blood sample consists of whole blood, wherein said blood sample consists of a drop 
of blood, and wherein said blood sample consists of blood that has been lysed. 

In another embodiment of the present invention, there is a method of diagnosing or 
prognosing a disease in an individual, comprising the steps of: a) determining the level of one or 
more gene transcripts in blood obtained from said individual suspected of having a disease, and 
20 b) comparing the level of each of said one or more gene transcripts in said blood according to 
step a) with the level of each of said one or more gene transcripts in blood from one or more 
individuals not having a disease, wherein detecting a difference in the levels of each of said one 
or more gene transcripts in the comparison of step b) is indicative of a disease in the individual 
of step a). 

25 In another embodiment of the present invention, there is a method of diagnosing or 

prognosing a disease in an individual, comprising the steps of: a) determining the level of one or 
more gene transcripts in blood obtained from said individual suspected of having a disease, and 
b) comparing the level of each of said one or more gene transcripts in said blood according to 
step a) with the level of each of said one or more gene transcripts in blood from one or more 
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individuals having a disease, wherein detecting the same levels of each of said one or more gene 
transcripts in the comparison of step b) is indicative of a disease in the individual of step a). 

In another embodiment of the present invention, there is a method of determining a stage 
of disease progression or regression in an individual having a disease, comprising the steps of:a) 
determining the level of one or more gene transcripts in blood obtained from said individual 
having a disease, and b) comparing the level of each if said one or more gene transcripts in said 
blood according to step a) with the level of each of said one or more gene transcripts in blood 
obtained from one or more individuals who each have been diagnosed as being at the same 
progressive or regressive stage of a disease, wherein the comparison from step b) allows the 
determination of the stage of a disease progression or regression in an individual. 

In another embodiment of the present invention, there is a method of diagnosing or 
prognosing osteoarthritis in an individual, comprising the steps of: a) determining the level of 
one or more gene transcripts expressed in blood obtained from said individual, wherein said one 
or more gene transcripts correspond to said one or more markers of claim 1 and claim 2, and b) 
comparing the level of each of said one or more gene transcripts in said blood according to step 
a) with the level of each of said one or more gene transcripts in blood from one or more 
individuals having osteoarthritis, c) comparing the level of each of said one or more gene 
transcripts in said blood according to step a) with the level of each of said one or more gene 
transcripts in blood from one or more individuals not having osteoarthritis, d) determining 
whether the level of said one or more gene transcripts of step a) classify with the levels of said 
transcripts in step b) as compared with the levels of said transcripts in step c) wherein said 
determination is indicative of said individual of step a) having osteoarthritis. 

In another embodiment of the present invention, there is a method of determining a stage 
of disease progression or regression in an individual having osteoarthritis, comprising the steps 
of: a) determining the level of one or more gene transcripts expressed in blood obtained from 
said individual having said stage of osteoarthritis, wherein said one or more gene transcripts 
correspond to the markers of claim 3 and claim 4, and b) comparing the level of each of said one 
or more gene transcripts in said blood according to step a) with the level of each of said one or 
more gene transcripts in blood from one or more individuals having said stage of osteoarthritis, 
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c) comparing the level of each of said one or more gene transcripts in said blood according to 
step a) with the level of each of said one or more gene transcripts in blood from one or more 
individuals not having said stage of osteoarthritis, d) determining whether the level of said one or 
more gene transcripts of step a) classify with the levels of said transcripts in step b) as compared 
5 with levels of said transcripts in step c), wherein said determination is indicative of said 
individual of step a) having said stage of osteoarthritis. 

Further embodiments of the methods described in the previous ten paragraphs include 
embodiments comprising a further step of isolating RNA from said blood samples, and 
embodiments comprising determining the level of each of said one or more gene transcripts 

10 comprising quantitative RT-PCR (QRT-PCR), wherein said one or more transcripts are from step 
a) and/or step b) of said methods. Further embodiments of these methods include embodiments 
wherein said QRT-PCR comprises primers which hybridize to one or more transcripts or the 
complement thereof, wherein said one or more transcripts are from step a) and/or step b) of said 
methods, embodiments wherein said primers are 15-25 nucleotides in length, and embodiments 

15 wherein said primers hybridize to one or more of the sequences of any one of Table 3A-Z and 
Tables 3AA, 3AB, 3 AC and 3 AD, or the complement thereof. Further embodiments of the 
methods described in the previous eight paragraphs include embodiments wherein the step of 
determining the level of each of said one or more gene transcripts comprises hybridizing a first 
plurality of isolated nucleic acid molecules that correspond to said one or more transcripts to an 

20 array comprising a second plurality of isolated nucleic acid molecules, wherein in one 

embodiment said first plurality of isolated nucleic acid molecules comprises RNA, DNA, cDNA, 
PCR products or ESTs, wherein in one embodiment said array comprises a plurality of isolated 
nucleic acid molecules comprising RNA, DNA, cDNA, PCR products or ESTs, wherein in one 
embodiment said array comprises two or more of the genetic markers of said methods, wherein 

25 in one embodiment said array comprises a plurality of nucleic acid molecules that correspond to 
genes of the human genome. 

In another embodiment of the present invention, there is a plurality of nucleic acid 
molecules that correspond to two or more sequences from each of any one of Table 3A-Z and 
Tables 3AA, 3AB, 3AC and 3 AD. 



11 



Attorney Docket No.: 4231/2055C 



In another embodiment of the present invention, there is an array which comprises a 
plurality of nucleic acid molecules that correspond to two or more sequences from each of any 
one of Table 3A-Z and Tables 3AA, 3AB, 3 AC and 3 AD. 

In another embodiment of the present invention, there is a kit for diagnosing or 
5 prognosing a disease comprising: a) two gene-specific priming means designed to produce 
double stranded DNA complementary to a gene selected from the group consisting of Table 3L; 
wherein said first priming means contains a sequence which can hybridize to RNA, cDNA or an 
EST complementary to said gene to create an extension product and said second priming means 
capable of hybridizing to said extension product; b) an enzyme with reverse transcriptase activity 
10 c) an enzyme with thermostable DNA polymerase activity and d) a labeling means; wherein said 
primers are used to detect the quantitative expression levels of said gene in a test subject 

In another embodiment of the present invention, there is a kit for monitoring a course of 
therapeutic treatment of a disease, comprising a) two gene-specific priming means designed to 
produce double stranded DNA complementary to a gene selected group consisting of any one of 

15 Table 3A-Z and Tables 3AA, 3AB, 3 AC and 3 AD; wherein said first priming means contains a 
sequence which can hybridize to RNA, cDNA or an EST complementary to said gene to create 
an extension product and said second priming means capable of hybridizing to said extension 
product; b) an enzyme with reverse transcriptase activity c) an enzyme with thermostable DNA 
polymerase activity and d) a labeling means; wherein said primers are used to detect the 

20 quantitative expression levels of said gene in a test subject. 

In another embodiment of the present invention, there is a kit for monitoring progression 
or regressionof a disease, comprising: a) two gene-specific priming means designed to produce 
double stranded DNA complementary to a gene selected group consisting of any one of Table 
3A-Z and Tables 3AA, 3AB, 3AC and 3 AD; wherein said first priming means contains a 
25 sequence which can hybridize to RNA, cDNA or an EST complementary to said gene to create 
an extension product and said second priming means capable of hybridizing to said extension 
product; b) an enzyme with reverse transcriptase activity c) an enzyme with thermostable DNA 
polymerase activity and d) a labeling means; wherein said primers are used to detect the 
quantitative expression levels of said gene in a test subject. 

12 



Attorney Docket No.: 423 1/2055C 



In another embodiment of the present invention, there is a plurality of nucleic acid 
molecules that identify or correspond to two or more sequences from any one of Table 3A-Z and 
Tables 3AA, 3AB, 3AC and 3 AD. 

Other and further aspects, features, and advantages of the present invention will be 
5 apparent from the following description of the presently preferred embodiments of the 
invention. These embodiments are given for the purpose of disclosure. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above-recited features, advantages and objects of the invention, as well as others 
10 which will become clear, are attained and can be understood in detail, more particular 
descriptions of the invention briefly summarized above may be had by reference to certain 
embodiments thereof which are illustrated in the appended drawings. These drawings form a 
part of the specification. It is to be noted, however, that the appended drawings illustrate 
preferred embodiments of the invention and therefore are not to be considered limiting in their 
15 scope. 

Figure 1 shows the following RNA samples prepared from human blood; Figure 1A: Lane 1, 
Molecular weight marker; Lane 2, RT-PCR on APP gene; Lane 3, PCR on APP gene; Lane 4, 
RT-PCR on APC gene; Lane 5, PCR on APC gene; Figure IB: Lanes 1 and 2, RT-PCR and 
PCR of pMyHC, respectively; Lanes 3 and 4, RT-PCR of PMyHC from RNA prepared from 
20 human fetal and human adult heart, respectively; Lane 5, Molecular weight marker. 

Figure 2 shows quantitative RT-PCR analysis performed on RNA samples extracted from a 
drop of blood. Forward primer (5'-GCCCTCTGGGGACCTGAC-3\ SEQ ID No. 1) of exon 1 
and reverse primer (5'-CCCACCTGCAGGTCCTCT-3", SEQ ID No. 2) of exons 1 and 2 of 
insulin gene. Blood samples of 4 normal subjects were assayed. Lanes 1, 3, 5 and 7 represent 
25 overnight "fasting" blood sample and lanes 2, 4, 6 and 8 represent "non-fasting" samples. 
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Figure 3 shows quantitative RT-PCR analysis performed on RNA samples extracted from a 
drop of blood. Lanes 1 and 2 represent normal healthy person and lane 3 represents late-onset 
diabetes (Type II) and lane 4 represents asymptomatic diabetes. 

Figure 4 shows multiple RT-PCR assay in a drop of blood. Primers were derived from insulin 
gene (INS), zinc-finger protein gene (ZFP) and house-keeping gene (GADH). Lane 1 
represents normal person. Lane 2 represents late-onset diabetes and lane 3 represents 
asymptomatic diabetes. 

Figure 5 shows standardized levels of insulin gene (Figure 5 A) and ZFP gene (Figure 5B) 
expressed in a drop of blood. The first three subjects were normal, second two subjects 
showed normal glucose tolerance, and the last subject had late onset diabetes type II. Figure 
5C shows standardized levels of insulin gene expressed in each fractionated cell from whole 
blood. 

Figure 6 shows the differential screening of human blood cell cDNA library with different 
cDNA probes of heart and brain tissue. Figure 6 A shows blood cell cDNA probes vs. adult 
heart cDNA probes. Figure 6B shows blood cell cDNA probes vs. human brain cDNA probes. 

Figure 7 graphically shows the 1,800 unique genes in human blood and in the human fetal 
heart grouped into seven cellular functions. 

Figure 8 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals having both osteoarthritis and hypertension as compared with gene expression 
profiles from normal individuals. 

Figure 9 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having both osteoarthritis and who were obese as 
described herein as compared with gene expression profiles from normal individuals 

Figure 10 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having both osteoarthritis and allergies as described 
herein as compared with gene expression profiles from normal individuals. 
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Figure 11 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals having osteoarthritis and who were subject to systemic steroids as described 
herein as compared with gene expression profiles from normal individuals. 

Figure 12 shows a diagrammatic representation of gene expression profiles of blood samples 
5 from individuals having hypertension as compared with gene expression profiles from samples 
of both non-hypertensive and normal individuals. 

Figure 13 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as obese as described herein as compared with gene 
expression profiles from normal and non-obese individuals. 

Figure 14 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having type 2 diabetes as described herein as 
compared with gene expression profiles from normal and non-type 2 diabetes individuals. 

Figure 15 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having hyperlipidemia as described herein as 
compared with gene expression profiles from normal and non-hyperlipidemia patients. 

Figure 16 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having lung disease as described herein as compared 
with gene expression profiles from normal and non lung disease individuals. 

Figure 17 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having bladder cancer as described herein as compared 
with gene expression profiles from non bladder cancer individuals. 

Figure 18 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having advanced stage bladder cancer or early stage 
bladder cancer as described herein as compared with gene expression profiles from non 
25 bladder cancer individuals. 

Figure 19 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having coronary artery disease (CAD) as described 
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herein as compared with gene expression profiles from non-coronary artery disease 
individuals. 

Figure 20 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having rheumatoid arthritis as described herein as 
compared with gene expression profiles from non-rheumatoid arthritis individuals. 

Figure 21 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having depression as described herein as compared 
with gene expression profiles from non-depression individuals. 

Figure 22 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having various stages of osteoarthritis as described 
herein as compared with gene expression profiles from normal individuals. 

Figure 23 shows RT-PCR of overexpressed genes in CAD peripheral blood cells identified using 
microarray experiments, including PBP, PF4 and F13A. 

Figure 24 shows the "Blood Chip", a cDNA microarray slide with 10,368 PCR products derived 
from peripheral blood cell cDNA libraries. Colors represent hybridization to probes labeled with 
Cy3 (green) or Cy5 (red). Yellow spots indicate common hybridization between both probes. In 
slide A, normal blood cell RNA samples were labeled with Cy3 and CAD blood cell RNA 
samples were labeled with Cy5. In slide B, Cy3 and Cy5 were switched to label the RNA 
samples. (Cluster analysis revealed distinct gene expression profiles for normal and CAD 
samples.) 

Figure 25 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having liver cancer as described herein as compared 
with gene expression profiles from normal individuals. 

Figure 26 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having schizophrenia as described herein as compared 
with gene expression profiles from normal individuals. 
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Figure 27 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having symptomatic or asymptomatic chagas disease as 
described herein as compared with gene expression profiles from normal individuals. 

Figure 28 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having asthma and OA as compared with individuals 
having just OA. 

Figure 29 shows a venn diagram illustrating a summary of the analysis comparing hypertension 
and OA patients vs. normal (Table 3A) hypertension and OA patients vs. OA patients (Table 3P) 
and the intersection between the two populations of genes (Table 3Q). 

Figure 30 shows a venn diagram illustrating a summary of the analysis comparing obesity and 
OA patients vs. normal (Table 3B) obesity and OA patients vs. OA patients (Table 3R) and the 
intersection between the two populations of genes (Table 3S). 

Figure 31 shows a venn diagram illustrating a summary of the analysis comparing allergy and 
OA patients vs. normal (Table 3C) allergy and OA patients vs. OA patients (Table 3T) and the 
intersection between the two populations of genes (Table 3U). 

Figure 32 shows a venn diagram illustrating a summary of the analysis comparing systemic 
steroids and OA patients vs. normal (Table 3D) systemic steroids and OA patients vs. OA 
patients (Table 3 V) and the intersection between the two populations of genes (Table 3W). 

Figure 33 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having Manic Depression as compared with those 
individuals who have Schizophrenia. 

Figure 34 shows a diagrammatic representation of gene expression profiles of blood samples 
from individuals who were identified as having OA and being one form of systemic steroids. 

Figure 35 shows a representation of the presentation of various stages of OA in patients of with 
respect to the age group of the patients. 
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DETAILED DESCRIPTION 

In accordance with the present invention, there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. 
Such techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, 
"Molecular Cloning: A Laboratory Manual (1982); "DNA Cloning: A Practical Approach," 
Volumes I and II (D.N. Glover ed. 1985); "Oligonucleotide Synthesis" (M.J. Gait ed. 1984); 
"Nucleic Acid Hybridization" [B.D. Hames & SJ. Higgins eds. (1985)]; "Transcription and 
Translation" [B.D. Hames & SJ. Higgins eds. (1984)]; "Animal Cell Culture" [R.I. Freshney, 
ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A Practical 
Guide To Molecular Cloning" (1984). Therefore, if appearing herein, the following terms shall 
have the definitions set out below. 

A "cDNA" is defined as copy-DNA or complementary-DNA, and is a product of a 
reverse transcription reaction from an mRNA transcript. "RT-PCR" refers to reverse 
transcription polymerase chain reaction and results in production of cDNAs that are 
complementary to the mRNA template(s). 

In addition to RT-PCR, other methods of amplifying may also be used for the purpose 
of measuring/quantitating tissue-specific transcripts in human blood. For example, mass 
spectrometry may be used to quantify the transcripts (Koster et al., 1996; Fu et al., 1998). The 
application of presently disclosed method for detecting tissue-specific transcripts in blood does 
not restrict to subjects undergoing course of therapy or treatment, it may also be used for 
monitoring a patient for the onset of overt symptoms of a disease. Furthermore, the present 
method may be used for detecting any gene transcripts in blood. A kit for diagnosing, 
prognosing or even predicting a disease may be designed using gene-specific primers or probes 
derived from a whole blood sample for a specific disease and applied directly to a drop of 
blood. A cDNA library specific for a disease may be generated from whole blood samples and 
used for diagnosis, prognosis or even predicting a disease. 

The term "oligonucleotide" is defined as a molecule comprised of two or more 
deoxyribonucleotides and/ or ribonucleotides, preferably more than three. Its exact size will 
depend upon many factors which, in turn, depend upon the ultimate function and use of the 
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oligonucleotide. The upper limit may be 15, 20, 25, 30, 40 or 50 nucleotides in length. The 
term "primer" as used herein refers to an oligonucleotide, whether occurring naturally as in a 
purified restriction digest or produced synthetically, which is capable of acting as a point of 
initiation of synthesis when placed under conditions in which synthesis of a primer extension 
5 product, which is complementary to a nucleic acid strand, is induced, i.e., in the presence of 
nucleotides and an inducing agent such as a DNA polymerase and at a suitable temperature and 
pH. The primer may be either single-stranded or double-stranded and must be sufficiently long 
to prime the synthesis of the desired extension product in the presence of the inducing agent. 
The exact length of the primer will depend upon many factors, including temperature, source 
10 of primer and the method used. For example, for diagnostic applications, depending on the 
complexity of the target sequence, the oligonucleotide primer typically contains 15-25 or more 
nucleotides, although it may contain fewer nucleotides. The factors involved in determining 
the appropriate length of primer are readily known to one of ordinary skill in the art. 

As used herein, random sequence primers refer to a composition of primers of random 
15 sequence, i.e. not directed towards a specific sequence. These sequences possess sufficient 
complementary to hybridize with a polynucleotide and the primer sequence need not reflect the 
exact sequence of the template. 

"Restriction fragment length polymorphism" refers to variations in DNA sequence 
detected by variations in the length of DNA fragments generated by restriction endonuclease 
20 digestion. 

A standard Northern blot assay can be used to ascertain the relative amounts of mRNA 
in a cell or tissue obtained from plant or other tissue, in accordance with conventional Northern 
hybridization techniques known to those persons of ordinary skill in the art. The Northern blot 
uses a hybridization probe, e.g. radiolabeled cDNA, either containing the full-length, single 
25 stranded DNA or a fragment of that DNA sequence at least 20 (preferably at least 30, more 
preferably at least 50, and most preferably at least 100 consecutive nucleotides in length). The 
DNA hybridization probe can be labeled by any of the many different methods known to those 
skilled in this art. The labels most commonly employed for these studies are radioactive 
elements, enzymes, chemicals which fluoresce when exposed to ultraviolet light, and others. A 
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number of fluorescent materials are known and can be utilized as labels. These include, for 
example, fluorescein, rhodamine, auramine, Texas Red, AMCA blue and Lucifer Yellow. A 
particular detecting material is anti-rabbit antibody prepared in goats and conjugated with 
fluorescein through an isothiocyanate. Proteins can also be labeled with a radioactive element 

5 or with an enzyme. The radioactive label can be detected by any of the currently available 
counting procedures. The preferred isotope may be selected from 3 H, 14 C, 32 P, 35 S, 36 C1, 5l Cr, 
57 Co, 58 Co, 59 Fe, 90 Y, 125 1, 131 I, and 186 Re. Enzyme labels are likewise useful, and can be 
detected by any of the presently utilized colorimetric, spectrophotometric, 
fluorospectrophotometric, amperometric or gasometric techniques. The enzyme is conjugated 

10 to the selected particle by reaction with bridging molecules such as carbodiimides, 
diisocyanates, glutaraldehyde and the like. Many enzymes which can be used in these 
procedures are known and can be utilized. The preferred are peroxidase, P-glucuronidase, 
glucosidase, P-D-galactosidase, urease, glucose oxidase plus peroxidase and alkaline 
phosphatase. U.S. Patent Nos. 3,654,090, 3,850,752, and 4,016,043 are referred to by way of 

15 example for their disclosure of alternate labeling material and methods. 

As used herein, "individual" refers to human subjects as well as non-human subjects. 
The examples herein are not meant to limit the methodology of the present invention to human 
subjects only, as the instant methodology is useful in the fields of veterinary medicine, animal 
sciences and such. The term "individual" refers to human subjects and non-human subjects 
20 who are disease or condition free and also includes human and non-human subjects diagnosed 
with one or more diseases or conditions, as defined herein. "Co-morbid individuals" or 
"comorbidity" or "individuals considered as co-morbid" are individuals who have more than 
one disease or condition as defined herein. For example a patient diagnosed with both 
osteoarthritis and hypertension is considered to present with comorbidities. 

25 As used herein, "detecting" refers to determining the presence of a gene expression 

product, for example cDNA, RNA or EST, by any method known to those of skill in the art or 
taught in numerous texts and laboratory manuals (see for example, Ausubel et al. Short 
Protocols in Molecular Biology (1995) 3rd Ed. John Wiley & Sons, Inc.). For example, 
methods of detection include but are not limited to, RNA fingerprinting, Northern blotting, 

30 polymerase chain reaction, ligase chain reaction, Qbeta replicase, isothermal amplification 
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method, strand displacement amplification, transcription based amplification systems, nuclease 
protection (SI nuclease or RNAse protection assays) as well as methods disclosed in 
WO88/10315, WO89/06700, PCT/US87/00880, PCT/US 89/0 1025. 

As used herein, a disease of the invention includes, but is not limited to, blood disorder, 

5 blood lipid disease, autoimmune disease, arthritis (including osteoarthritis, rheumatoid arthritis, 
lupus, allergies, juvenile rheumatoid arthritis and the like), bone or joint disorder, a 
cardiovascular disorder (including heart failure, congenital heart disease; rheumatic fever, 
valvular heart disease; corpulmonale, cardiomyopathy, myocarditis, pericardial disease; vascular 
diseases such as atherosclerosis, acute myocardial infarction, ischemic heart disease and the 

10 like), obesity, respiratory disease (including asthma, pneumonitis, pneumonia, pulmonary 
infections, lung disease, bronchiectasis, tuberculosis, cystic fibrosis, interstitial lung disease, 
chronic bronchitis emphysema, pulmonary hypertension, pulmonary thromboembolism, acute 
respiratory distress syndrome and the like), hyperlipidemias, endocrine disorder, immune 
disorder, infectious disease, muscle wasting and whole body wasting disorder, neurological 

15 disorders (including migraines, seizures, epilepsy, cerebrovascular diseases, alzheimers, 

dementia, Parkinson's, ataxic disorders, motor neuron diseases, cranial nerve disorders, spinal 
cord disorders, meningitis and the like) including neurodegenerative and/or neuropsychiatry 
diseases and mood disorders (including schizophrenia, anxiety, bipolar disorder; manic 
depression and the like, skin disorder, kidney disease, scleroderma, stroke, hereditary 

20 hemorrhage telangiectasia, diabetes, disorders associated with diabetes (e.g., PVD), 

hypertension, Gaucher's disease, cystic fibrosis, sickle cell anemia, liver disease, pancreatic 
disease, eye, ear, nose and/or throat disease, diseases affecting the reproductive organs, 
gastrointestinal diseases (including diseases of the colon, diseases of the spleen, appendix, gall 
bladder, and others) and the like. For further discussion of human diseases, see Mendelian 

25 Inheritance in Man: A Catalog of Human Genes and Genetic Disorders by Victor A. McKusick 
(12th Edition (3 volume set) June 1998, Johns Hopkins University Press, ISBN: 0801857422) 
and Harrison's Principles of Internal Medicine by Braunwald, Fauci, Kasper, Hauser, Longo, & 
Jameson (15th Edition 2001), the entirety of which is incorporated herein. 

In another embodiment of the invention, a disease refers to an immune disorder, such as 
30 those associated with overexpression of a gene or expression of a mutant gene (e.g., 
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autoimmune diseases, such as diabetes mellitus, arthritis (including rheumatoid arthritis, 
juvenile rheumatoid arthritis, osteoarthritis, psoriatic arthritis), multiple sclerosis, 
encephalomyelitis, myasthenia gravis, systemic lupus erythematosis, automimmune thyroiditis, 
dermatitis (including atopic dermatitis and eczematous dermatitis), psoriasis, Sjogren's 
5 Syndrome, Crohn's disease, aphthous ulcer, iritis, conjunctivitis, keratoconjunctivitis, 
ulcerative colitis, asthma, allergic asthma, cutaneous lupus erythematosus, scleroderma, 
vaginitis, proctitis, drug eruptions, leprosy reversal reactions, erythema nodosum leprosum, 
autoimmune uveitis, allergic encephalomyelitis, acute necrotizing hemorrhagic 
encephalopathy, idiopathic bilateral progressive sensorineural hearing, loss, aplastic anemia, 
10 pure red cell anemia, idiopathic thrombocytopenia, polychondritis, Wegener's granulomatosis, 
chronic active hepatitis, Stevens-Johnson syndrome, idiopathic sprue, lichen planus, Graves' 
disease, sarcoidosis, primary biliary cirrhosis, uveitis posterior, and interstitial lung fibrosis), 
graft- versus-host disease, cases of transplantation, and allergy. 

In another embodiment, a disease of the invention is a cellular proliferative and/or 

15 differentiative disorder that includes, but is not limited to, cancer e.g., carcinoma, sarcoma or 
other metastatic disorders and the like. As used herein, the term "cancer" refers to cells having 
the capacity for autonomous growth, i.e., an abnormal state of condition characterized by 
rapidly proliferating cell growth. "Cancer" is meant to include all types of cancerous growths 
or oncogenic processes, metastatic tissues or malignantly transformed cells, tissues, or organs, 

20 irrespective of histopathologic type or stage of invasiveness. Examples of cancers include but 
are nor limited to solid tumors and leukemias, including: apudoma, choristoma, branchioma, 
malignant carcinoid syndrome, carcinoid heart disease, carcinoma (e.g., Walker, basal cell, 
basosquamous, Brown-Pearce, ductal, Ehrlich tumour, in situ, Krebs 2, Merkel cell, mucinous, 
non-small cell lung, oat cell, papillary, scirrhous, bronchiolar, bronchogenic, squamous cell, 

25 and transitional cell), histiocytic disorders, leukaemia (e.g., B cell, mixed cell, null cell, T cell, 
T-cell chronic, HTLV-II-associated, lymphocytic acute, lymphocytic chronic, mast cell, and 
myeloid), histiocytosis malignant, Hodgkin disease, immunoproliferative small, non-Hodgkin 
lymphoma, plasmacytoma, reticuloendotheliosis, melanoma, chondroblastoma, chondroma, 
chondrosarcoma, fibroma, fibrosarcoma, giant cell tumors, histiocytoma, lipoma, liposarcoma, 

30 mesothelioma, myxoma, myxosarcoma, osteoma, osteosarcoma, Ewing sarcoma, synovioma, 
adenofibroma, adenolymphoma, carcinosarcoma, chordoma, craniopharyngioma, 
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dysgerminoma, hamartoma, mesenchymoma, mesonephroma, myosarcoma, ameloblastoma, 
cementoma, odontoma, teratoma, thymoma, trophoblastic tumour, adeno-carcinoma, adenoma, 
cholangioma, cholesteatoma, cylindroma, cystadenocarcinoma, cystadenoma, granulosa cell 
tumour, gynandroblastoma, hepatoma, hidradenoma, islet cell tumour, Leydig cell tumour, 
papilloma, Sertoli cell tumour, theca cell tumour, leiomyoma, leiomyosarcoma, myoblastoma, 
mymoma, myosarcoma, rhabdomyoma, rhabdomyosarcoma, ependymoma, ganglioneuroma, 
glioma, medulloblastoma, meningioma, neurilemmoma, neuroblastoma, neuroepithelioma, 
neurofibroma, neuroma, paraganglioma, paraganglioma nonchromaffin, angiokeratoma, 
angiolymphoid hyperplasia with eosinophilia, angioma sclerosing, angiomatosis, 
glomangioma, hemangioendothelioma, hemangioma, hemangiopericytoma, hemangiosarcoma, 
lymphangioma, lymphangiomyoma, lymphangiosarcoma, pinealoma, carcinosarcoma, 
chondrosarcoma, cystosarcoma, phyllodes, fibrosarcoma, hemangiosarcoma, leimyosarcoma, 
leukosarcoma, liposarcoma, lymphangiosarcoma, myosarcoma, myxosarcoma, ovarian 
carcinoma, rhabdomyosarcoma, sarcoma (e.g., Ewing, experimental, Kaposi, and mast cell), 
neoplasms (e.g., bone, breast, digestive system, colorectal, liver, pancreatic, pituitary, 
testicular, orbital, head and neck, central nervous system, acoustic, pelvic respiratory tract, and 
urogenital), neurofibromatosis, and cervical dysplasia, and other conditions in which cells have 
become immortalized or transformed. 

In another embodiment, a disease of the invention includes but is not limited to a 
condition wherein said condition is reflective of the state of a particular individual, whether 
said state is a physical, emotional or psychological state, said state resulting from the 
progression of time, treatment, environmental factors or genetic factors. 

As used herein, a gene of the invention is a gene that is expressed in blood and is either 
upregulated, or downregulated and can be used, either solely or in conjunction with other genes, 
as a marker for disease as defined herein. By a gene that is expressed in blood or in a blood 
sample is meant a gene that is expressed in the cells which typically make up blood including 
monocytes, leukocytes, lymphocytes and erythrocytes, all other cells derived directly from 
haemopoietic or mesenchymal stem cells, or derived directly from a cell which typically makes 
up the blood. 
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The term "gene" includes a region that can be transcribed into RNA, as the invention 
contemplates detection of RNA or equivalents thereof, i.e., cDNA or EST. A gene of the 
invention includes but is not limited to genes specific for or involved in a particular biological 
process, such as apoptosis, differentiation, stress response, aging, proliferation, etc.; cellular 
5 mechanism genes, e.g. cell-cycle, signal transduction, metabolism of toxic compounds, and the 
like; disease associated genes, e.g. genes involved in cancer, schizophrenia, diabetes, high 
blood pressure, atherosclerosis, viral-host interaction and infection and the like. 

For example, the gene of the invention can be an oncogene (Hanahan, D. and R.A. 
Weinberg, Cell (2000) 100:57; and Yokota, J., Carcinogenesis (2000) 21(3):497-503) whose 

10 expression within a cell induces that cell to become converted from a normal cell into a tumor 
cell. Further examples of genes of the invention include, but are not limited to, cytokine genes 
(Rubinstein, M., et al., Cytokine Growth Factor Rev. (1998) 9(2):175-81); idiotype (Id) protein 
genes (Benezra, R, et al., Oncogene (2001) 20(58):8334-41; Norton, J.D., J. Cell Sci. (2000) 
1 13(22):3897-905); prion genes (Prusiner, S.B., et al., Cell (1998) 93(3):337-48; Safar, J., and 

15 SB. Prusiner, Prog. Brain Res. (1998) 1 17:421-34); genes that express molecules that induce 
angiogenesis (Gould, V.E. and B.M. Wagner, Hum. Pathol. (2002) 33(1 1):1061-3); genes 
encoding adhesion molecules (Chothia, C. and E.Y. Jones, Annu. Rev. Biochem. (1997) 
66:823-62; Parise, L.V., et al., Semin. Cancer Biol. (2000) 10(6):407-14); genes encoding cell 
surface receptors (Deller, M.C., and Y.E. Jones, Curr. Opin. Struct. Biol. (2000) 10(2):213-9); 

20 genes of proteins that are involved in metastasizing and/or invasive processes (Boyd, D., 

Cancer Metastasis Rev. (1996) 15(l):77-89; Yokota, J., Carcinogenesis (2000) 21(3):497-503); 
genes of proteases as well as of molecules that regulate apoptosis and the cell cycle (Matrisian, 
L.M., Curr. Biol. (1999) 9(20):R776-8; Krepela, E., Neoplasma (2001) 48(5):332-49; Basbaum 
and Werb, Curr. Opin. Cell Biol. (1996) 8:731-738; Birkedal-Hansen, et al., Crit. Rev. Oral 

25 Biol. Med. (1993) 4:197-250; Mignatti and Rifkin, Physiol. Rev. (1993) 73:161-195; Stetler- 
Stevenson, et al., Annu. Rev. Cell Biol. (1993) 9:541-573; Brinkerhoff, E., and L.M. Matrisan, 
Nature Reviews (2002) 3:207-214; Strasser, A., et al., Annu. Rev. Biochem. (2000) 69:217-45; 
Chao, D.T. and S.J. Korsmeyer, Annu. Rev. Immunol. (1998) 16:395-419; Mullauer, L., et al., 
Mutat. Res. (2001) 488(3):21 1-31; Fotedar, R., et al., Prog. Cell Cycle Res. (1996) 2:147-63; 

30 Reed, J.C., Am. J. Pathol. (2000) 157(5):1415-30; D'Ari, R, Bioassays (2001) 23(7):563-5); or 
multi-drug resistance genes, suach as MDR1 gene (Childs, S., and V. Ling, Imp. Adv. Oncol. 
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(1994) 21-36). In another embodiment, a gene of the invention contains a sequence found in 
Tables 2 or 3 or Figures 22 - 34. In another embodiment, a gene of the invention can be an 
immune response gene or a non-immune response gene. By an immune response gene is 
meant a primary defense response gene located outside the major histocompatibility region 
5 (MHC) that is initially triggered in response to a foreign antigen to regulate immune 

responsiveness. All other genes expressed in blood are considered to be non-immune response 
gene. For example, an immune response gene would be understood by a person skilled in the 
art to include: cytokines including interleukins and interferons such as TNF- alpha, IL-10, IL- 
12, IL-2, IL-4, IL-10, IL-12, IL-13, TGF-Beta, IFN-gamma; immunoglobulins, complement 
10 and the like (see for example Bellardelli, F. Role of interferons and other cytokines in the 
regulation of the immune response APMIS. 1995 Mar; 103(3): 161-79; ); . 

Construction of a Microarrav 

A nucleic acid microarray (RNA, DNA, cDNA, PCR products or ESTs) according to the 
15 invention was constructed as follows: 

Nucleic acids (RNA, DNA, cDNA, PCR products or ESTs) (~40iil) are precipitated with 
4|jl (1/10 volume) of 3M sodium acetate (pH 5.2) and 100 ul (2.5 volumes) of ethanol and stored 
overnight at -20°C. They are then centrifuged at 3,300 rpm at 4°C for 1 hour. The obtained 
pellets were washed with 50|al ice-cold 70% ethanol and centrifuged again for 30 minutes. The 

20 pellets are then air-dried and resuspended well in 50% dimethylsulfoxide (DMSO) or 20|il 3X 
SSC overnight. The samples are then deposited either singly or in duplicate onto Gamma Amino 
Propyl Silane (Corning CMT-GAPS or CMT-GAP2, Catalog No. 40003, 40004) or polylysine- 
coated slides (Sigma Cat. No. P0425) using a robotic GMS 417 or 427 arrayer (Affymetrix, CA). 
The boundaries of the DNA spots on the microarray are marked with a diamond scriber. The 

25 invention provides for arrays where 10-20,000 different DNAs are spotted onto a solid support to 
prepare an array, and also may include duplicate or triplicate DNAs. 

The arrays are rehydrated by suspending the slides over a dish of warm particle free 
ddH20 for approximately one minute (the spots will swell slightly but not run into each other) 
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and snap-dried on a 70-80°C inverted heating block for 3 seconds. DNA is then UV crosslinked 
to the slide (Stratagene, Stratalinker, 65mJ - set display to "650" which is 650 x lOO^J) or baked 
at 80°C for two to four hours. The arrays are placed in a slide rack. An empty slide chamber is 
prepared and filled with the following solution: 3.0 grams of succinic anhydride (Aldrich) is 

5 dissolved in 189ml of l-methyl-2-pyrrolidinone (rapid addition of reagent is crucial); 

immediately after the last flake of succinic anhydride dissolved, 21.0ml of 0.2 M sodium borate 
is mixed in and the solution is poured into the slide chamber. The slide rack is plunged rapidly 
and evenly in the slide chamber and vigorously shaken up and down for a few seconds, making 
sure the slides never leave the solution, and then mixed on an orbital shaker for 15-20 minutes. 

10 The slide rack is then gently plunged in 95°C ddH 2 0 for 2 minutes, followed by plunging five 
times in 95% ethanol. The slides are then air dried by allowing excess ethanol to drip onto paper 
towels. The arrays are then stored in the slide box at room temperature until use. 

Nucleic acid Microarrays 

15 Any combination of the nucleic acid sequences generated from nucleotides 

complimentary to regions of DNA expressed in blood are used for the construction of a 
microarray. In one embodiment, the microarray is chondrocyte-specific and encompasses genes 
which are important in the osteoarthritis disease process. A microarray according to the 
invention preferably comprises between 10, 100, 500, 1000, 5000, 10,000 and 15,000 nucleic 

20 acid members, and more preferably comprises at least 5000 nucleic acid members. The nucleic 
acid members are known or novel nucleic acid sequences described herein, or any combination 
thereof. A microarray according to the invention is used to assay for differential gene expression 
profiles of genes in blood samples from healthy patients as compared to patients with a disease. 

25 Microarray Used According to the invention 

The Human Genome U133 (HG-U133) Set, consisting of two GeneChip® arrays, 
contains almost 45,000 probe sets representing more than 39,000 transcripts derived from 
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approximately 33,000 well-substantiated human genes. This set design uses sequences selected 
from GenBank®, dbEST, and RefSeq. 

The sequence clusters were created from the UniGene database (Build 133, April 20, 
2001). They were then refined by analysis and comparison with a number of other publicly 
available databases including the Washington University EST trace repository and the University 
of California, Santa Cruz Golden Path human genome database (April 2001 release). 

The HG-U133A Array includes representation of the RefSeq database sequences and 
probe sets related to sequences previously represented on the Human Genome U95 Av2 Array. 
The HG-U133B Array contains primarily probe sets representing EST clusters. 

15 K ChondroChip™ - The ChondroChip™ is chondrocyte-specific microarray chip 
comprising 15,000 novel and known EST sequences of the chondrocyte from human 
chondrocyte-specific cDNA libraries. 

Controls on the ChondroChip™ - There are two types of controls used on microarrays. First, 
positive controls are genes whose expression level is invariant between different stages of 
investigation and are used to monitor: 

a) target DNA binding to the slide, 

b) quality of the spotting and binding processes of the target DNA onto the slide, 

c) quality of the RNA samples, and 

d) efficiency of the reverse transcription and fluorescent labeling of the probes. 
Second, negative controls are external controls derived from an organism unrelated to 

and therefore unlikely to cross-hybridize with the sample of interest. These are used to monitor 
for: 

a) variation in background fluorescence on the slide, and 

b) non-specific hybridization. 
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There are currently 63 control spots on the ChondroChip™ consisting of: 

Type Na 

Positive Controls: 2 
Alien DNA 12 
5 A. thaliana DNA 10 
Spotting Buffer 41 



BloodChip™ - The "BloodChip™" is a cDNA microarray slide with 10,368 PCR products 
derived from peripheral blood cell cDNA libraries as shown in Figure 24. 

10 Tareet Nucleic acid Preparation and Hybridization 

Preparation of Fluorescent DNA Probe from mRNA 

Fluorescently labeled target nucleic acid samples are prepared for analysis with an array 
of the invention. 

2|ig Oligo-dT primers are annealed to 2jxg of mRNA isolated from a blood sample of a 
15 . patient in a total volume of 15|il, by heating to 70°C for 10 min, and cooled on ice. The mRNA 
is reverse transcribed by incubating the sample at 42°C for 1 .5-2 hours in a 100 jil volume 
containing a final concentration of 50mM Tris-HCl (pH 8.3), 75mM KC1, 3mM MgCl 2 , 25mM 
DTT, 25mM unlabeled dNTPs, 400 units of Superscript II (200U/|iL, Gibco BRL), and 15mM of 
Cy3 or Cy5 (Amersham). RNA is then degraded by addition of 15^1 of 0.1N NaOH, and 
20 incubation at 70°C for 1 0 min. The reaction mixture is neutralized by addition of 1 5\i\ of 0. IN 
HC1, and the volume is brought to 500|il with TE (lOmM Tris, ImM EDTA), and 20 jag of Cotl 
human DNA (Gibco-BRL) is added. 

The labeled target nucleic acid sample is purified by centrifugation in a Centricon-30 
micro-concentrator (Amicon). If two different target nucleic acid samples (e.g., two samples 
25 derived from a healthy patient vs. patient with a disease) are being analyzed and compared by 
hybridization to the same array, each target nucleic acid sample is labeled with a different 
fluorescent label (e.g., Cy3 and Cy5) and separately concentrated. The separately concentrated 
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target nucleic acid samples (Cy3 and Cy5 labeled) are combined into a fresh centricon, washed 
with 500|il TE, and concentrated again to a volume of less than 7(al. \\xL of lOjig/jxl polyA RNA 
(Sigma, #P9403) and ljil of lOjig/fil tRNA (Gibco-BRL, #15401-01 1) is added and the volume is 
adjusted to 9.5|il with distilled water. For final target nucleic acid preparation 2.1|il 20XSSC 
5 (1.5M NaCl, 150mM NaCitrate (pH8.0)) and 0.35^1 10%SDS is added. 

Hybridization 

Labeled nucleic acid is denatured by heating for 2 min at 100°C, and incubated at 37°C 
for 20-30 min before being placed on a nucleic acid array under a 22mm x 22mm glass cover 
slip. Hybridization is carried out at 65°C for 14 to 18 hours in a custom slide chamber with 
10 humidity maintained by a small reservoir of 3XSSC The array is washed by submersion and 
agitation for 2-5 min in 2X SSC with 0.1%SDS, followed by IX SSC, and 0.1X SSC. Finally, 
the array is dried by centrifugation for 2 min in a slide rack in a Beckman GS-6 tabletop 
centrifuge in Microplus carriers at 650 RPM for 2 min. 

Signal Detection And Data Generation 

15 Following hybridization of an array with one or more labeled target nucleic acid samples, 

arrays are scanned immediately using a GMS Scanner 418 and Scanalyzer software (Michael 
Eisen, Stanford University), followed by GeneSpring™ software (Silicon Genetics, CA) 
analysis. Alternatively, a GMS Scanner 428 and Jaguar software may be used followed by 
GeneSpring™ software analysis. 

20 If one target nucleic acid sample is analyzed, the sample is labeled with one fluorescent 

dye (e.g., Cy3 or Cy5). 

After hybridization to a microarray as described herein, fluorescence intensities at the associated 
nucleic acid members on the microarray are determined from images taken with a custom 
confocal microscope equipped with laser excitation sources and interference filters appropriate 
25 for the Cy3 or Cy5 fluorescence. 

The presence of Cy3 or Cy5 fluorescent dye on the microarray indicates hybridization of 
a target nucleic acid and a specific nucleic acid member on the microarray. The intensity of Cy3 
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or Cy5 fluorescence represents the amount of target nucleic acid which is hybridized to the 
nucleic acid member on the microarray, and is indicative of the expression level of the specific 
nucleic acid member sequence in the target sample. 

After hybridization, fluorescence intensities at the associated nucleic acid members on 
5 the microarray are determined from images taken with a custom confocal microscope equipped 
with laser excitation sources and interference filters appropriate for the Cy3 and Cy5 fluors. 
Separate scans are taken for each fluor at a resolution of 225 (im 2 per pixel and 65,536 gray 
levels. Normalization between the images is used to adjust for the different efficiencies in 
labeling and detection with the two different fluors. This is achieved by manual matching of the 
10 detection sensitivities to bring a set of internal control genes to nearly equal intensity followed 
by computational calculation of the residual scalar required for optimal intensity matching for 
this set of genes. 

The presence of Cy3 or Cy5 fluorescent dye on the microarray indicates hybridization of 
a target nucleic acid and a specific nucleic acid member on the microarray. The intensities of 

15 Cy3 or Cy5 fluorescence represent the amount of target nucleic acid which is hybridized to the 
nucleic acid member on the microarray, and is indicative of the expression level of the specific 
nucleic acid member sequence in the target sample. If a nucleic acid member on the array shows 
no color, it indicates that the gene in that element is not expressed in either sample. If a nucleic 
acid member on the array shows a single color, it indicates that a labeled gene is expressed only 

20 in that cell sample. The appearance of both colors indicates that the gene is expressed in both 
tissue samples. The ratios of Cy3 and Cy5 fluorescence intensities, after normalization, are 
indicative of differences of expression levels of the associated nucleic acid member sequence in 
the two samples for comparison. A ratio of expression not equal to is used as an indication of 
differential gene expression. 

25 The array is scanned in the Cy 3 and Cy5 channels and stored as separate 16-bit TIFF 

images. The images are incorporated and analyzed using Scanalyzer software which includes a 
gridding process to capture the hybridization intensity data from each spot on the array. The 
fluorescence intensity and background-subtracted hybridization intensity of each spot is collected 
and a ratio of measured mean intensities of Cy5 to Cy3 is calculated. A liner regression 
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approach is used for normalization and assumes that a scatter plot of the measured Cy5 versus 
Cy3 intensities should have a scope of one. The average of the ratios is calculated and used to 
rescale the data and adjust the slope to one. A post-normalization cutoff of a ratio not equal to 
1.0-is used to identify differentially expressed genes. 

5 When comparing two or more samples for differences, results are reported as statistically 

significant when there is only a small probability that similar results would have been observed if 
the tested hypothesis (i.e., the genes are not expressed at different levels) were true. A small 
probability can be defined as the accepted threshold level at which the results being compared 
are considered significantly different. The accepted lower threshold is set at, but not limited to, 
10 0.05 (i.e., there is a 5% likelihood that the results would be observed between two or more 
identical populations) such that any values determined by statistical means at or below this 
threshold are considered significant. 

When comparing two or more samples for similarities, results are reported as statistically 
significant when there is only a small probability that similar results would have been observed if 

15 the tested hypothesis (i.e., the genes are not expressed at different levels) were true. A small 
probability can be defined as the accepted threshold level at which the results being compared 
are considered significantly different. The accepted lower threshold is set at, but not limited to, 
0.05 (i.e., there is a 5% likelihood that the results would be observed between two or more 
identical populations) such that any values determined by statistical means above this threshold 

20 are not considered significantly different and thus similar. 

Identification of genes differentially expressed in blood samples from patients with 
disease as compared to healthy patients or as compared to patients without said disease is 
determined by statistical analysis of the gene expression profiles from healthy patients or patients 
without disease compared to patients with disease using the Wilcox Mann Whitney rank sum 
25 test. Other statistical tests can also be used, see for example (Sokal and Rohlf (1987) 

Introduction to Biostatistics 2 nd edition, WH Freeman, New York), which is incorporated herein 
in their entirety. 

In order to facilitate ready access, e.g. for comparison, review, recovery and/or 
modification, the expression profiles of patients with disease and/or patients without disease or 
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healthy patients can be recorded in a database, whether in a relational database accessible by a 
computational device or other format, or a manually accessible indexed file of profiles as 
photographs, analogue or digital imaging, readouts spreadsheets etc. Typically the database is 
compiled and maintained at a central facility, with access being available locally and/or 
5 remotely. 

As would be understood by a person skilled in the art, comparison as between the 
expression profile of a test patient with expression profiles of patients with a disease, expression 
profiles of patients with a certain stage or degree of progression of said disease, without said 
disease, or a healthy patient so as to diagnose or prognose said test patient can occur via 
10 expression profiles generated concurrently or non concurrently. It would be understood that 
expression profiles can be stored in a database to allow said comparison. 

As additional test samples from test patients are obtained, through clinical trials, further 
investigation, or the like, additional data can be determined in accordance with the methods 
disclosed herein and can likewise be added to a database to provide better reference data for 
15 comparison of healthy and/or non-disease patients and/or certain stage or degree of progression 
of a disease as compared with the test patient sample. 

Use of Expression Profiles for Diagnostic Purposes 

As would be understood to a person skilled in the art, one can utilize sets of genes which 
have been identified as statistically significant as described above in order to characterize an 
20 unknown sample as having said disease or not having said disease. This is commonly termed 
"class prediction". 

Methods that can be used for class prediction analysis have been well described and 
generally involve a training phase using samples with known classification and a testing phase 
from which the algorithm generalizes from the training data so as to predict classification of 
25 unknown samples (see for Example Slonim, D. (2002), Nature Genetics Supp. Vol 32 502-8, 
Raychaudhuri et al. (2001) Trends Biotechnol 19: 189-193; Khan et al. (2001) Nature Med. 7 
673-9.; Golub et al. (1999) Science 286: 531-7. Hastie et al. (2000) Genome Biol. 1(2) Research 
0003.1-0003.21 all of which are incorporated herein by reference in their entirety ). 
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As additional samples are obtained, for example during clinical trials, their expression 
profiles can be determined and correlated with the relevant subject data in the database and 
likewise be recorded in said database. Algorithms as described above can be used to query 
additional samples against the existing database to further refine the diagnostic and/or prognostic 
5 determination by allowing an even greater association between the disease and gene expression 
signature. 

The diagnosing or prognosing may thus be performed by detecting the expression level of 
two or more genes, three or more genes, four or more genes, five or more genes, six or more 
genes, seven or more genes, eight or more genes, nine or more genes, ten or more genes, fifteen 
10 or more genes, twenty or more genes thirty or more genes, fifty or more genes, one hundred or 
more genes, two hundred or more genes, three hundred or more genes, five hundred or more 
genes or all of the genes disclosed for the specific disease in question. 

Data Acquisition and Analysis of differentially expressed EST Sequences 

The differentially expressed EST sequences are then searched against available 
15 databases, including the "nt", "nr", "est", "gss" and "htg" databases available through NCBI to 
determine putative identities for ESTs matching to known genes or other ESTs. Functional 
characterisation of ESTs with known gene matches are made according to any known method. 
Preferably, differentially expressed EST sequences are compared to the non-redundant 
Genbank/EMBL/DDBJ and dbEST databases using the BLAST algorithm (Altschul SF, Gish W, 
20 Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol 1 990;2 15:403- 
10). A minimum value of P =10~ 10 and nucleotide sequence identity >95%, where the sequence 
identity is non-contiguous or scattered, are required for assignments of putative identities for 
ESTs matching to known genes or to other ESTs. Construction of a non-redundant list of genes 
represented in the EST set is done with the help of Unigene, Entrez and PubMed at the National 
25 Center for Biotechnology Information (NCBI) web site at www.ncbi.nlm.nih.gov. 

Genes are identified from ESTs according to known methods. To identify novel genes 
from an EST sequence, the EST should preferably be at least 100 nucleotides in length, and more 
preferably 150 nucleotides in length, for annotation. Preferably, the EST exhibits open reading 
frame characteristics (i.e., can encode a putative polypeptide). 
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Because of the completion of the Human Genome Project, a specific EST which matches 
with a genomic sequence can be mapped onto a specific chromosome based on the chromosomal 
location of the genomic sequence. However, no function may be known for the protein encoded 
by the sequence and the EST would then be considered "novel" in a functional sense. In one 
5 aspect, the invention is used to identify a novel differentially expressed EST, which is part of a 
larger known sequence for which no function is known, is used to determine the function of a 
gene comprising the EST. Alternatively, or additionally, the EST can be used to identify an 
mRNA or polypeptide encoded by the larger sequence as a diagnostic or prognostic marker of a 
disease. 

Having identified an EST corresponding to a larger sequence, other portions of the larger 
sequence which comprises the EST can be used in assays to elucidate gene function, e.g., to 
isolate polypeptides encoded by the gene, to generate antibodies specifically reactive with these 
polypeptides, to identify binding partners of the polypeptides (receptors, ligands, agonists, 
antagonists and the like) and/or to detect the expression of the gene (or lack thereof) in healthy or 
diseased individuals. 

In another aspect, the invention provides for nucleic acid sequences that do not 
demonstrate a "significant match" to any of the publicly known sequences in sequence databases 
at the time a query is done. Longer genomic segments comprising these types of novel EST 
sequences can be identified by probing genomic libraries, while longer expressed sequences can 
20 be identified in cDNA libraries and/or by performing polymerase extension reactions (e.g., 
RACE) using EST sequences to derive primer sequences as is known in the art. Longer 
fragments can be mapped to particular chromosomes by FISH and other techniques and their 
sequences compared to known sequences in genomic and/or expressed sequence databases. 

The amino acid sequences encoded by the ESTs can also be used to search databases, 
25 such as GenBank, SWISS-PROT, EMBL database, PIR protein database, Vecbase, or GenPept 
for the amino acid sequences of the corresponding full-length genes according to procedures well 
known in the art. 

Identified genes can be catalogued according to their putative function. Functional 
characterization of ESTs with known gene matches is preferably made according to the 
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categories described by Hwang et al Compendium of Cardiovascular Genes. Circulation 
1997;96:4146-203. The distribution of genes in each of the subcellular categories will provide 
important insights into the disease process. 

Alternative methods for analysing ESTs are also available. For example, the ESTs may 
5 be assembled into contigs with sequence alignment, editing, and assembly programs such as 
PHRED and PHRAP (Ewing, et aL, 1998, Genome Res. 3:175, incorporated herein; and the web 
site at bozeman.genome.washington.edu). Contig redundancy is reduced by clustering 
nonoverlapping sequence contigs using the EST clone identification number, which is common 
for the nonoverlapping 5 and 3 sequence reads for a single EST cDNA clone. In one aspect, the 
10 consensus sequence from each cluster is compared to the non-redundant Genbank/EMBL/DDBJ 
and dbEST databases using the BLAST algorithm with the help of unigene, Entrez and PubMed 
at the NCBI site. 

Use of Expression Profiles to Predict Disease State 

15 One can also utilize sets of genes which have been identified as statistically significant as 

described above in order to predict whether an asymptomatic individual will develop symptoms 
of said disease or whether an individual with an early stage of said disease will develop a later 
stage of said disease. 

For example, as a result of analyzing over 780 individuals, we have surprisingly shown 
20 that almost all individuals in the 56 and over age group have either moderate, marked or severe 
OA, and furthermore that almost all individuals in the 61 and over age group have either marked 
or severe OA only (see Figure 35 and Table 3AE) even though there remain approximately 50% 
of Canadians over the age of 65 who do not show symptoms of osteoarthritis (Statistics Canada, 
Canadian Community Health Survey, 2000/2001). This data indicates that individuals with mild 
25 OA have a significantly increased chance of progressing to marked or severe OA as compared 
with individuals who do not have mild OA. 
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Table 3AE 



Age Distribution 



OA 


Sex 


<= 


21- 


26- 


31- 


36- 


41- 


46- 


51- 


56- 


61- 


66- 


71- 








20 


25 


30 


35 


40 


45 


50 


55 


60 


65 


70 


86 


Total 


Mild 


F 


5 


8 


17 


12 


9 


3 


0 


1 


0 


0 


0 


0 


55 


M 


5 


12 


14 


16 


13 


11 


4 


3 


0 


2 


0 


0 


80 


1-6 


Total 


10 


20 


31 


28 


22 


14 


4 


4 


0 


2 


0 


0 


135 | 




F 


1 


2 


12 


5 


8 


9 


11 


7 


2 


1 


2 


0 


60 


Moderate 


M 


4 


6 


7 


10 


18 


16 


17 


8 


5 


1 


0 


0 


92 


7-12 


Total 


5 


8 


19 


15 


26 


25 


28 


15 


7 


2 


2 


0 


152 i 




F 


0 


0 


1 


4 


4 


18 


21 


26 


26 


21 


14 


22 


157 


Marked 


M 


1 


0 


3 


7 


11 


10 


27 


28 


14 


10 


8 


18 


137 


13-18 


Total 


1 


0 


4 


11 


15 


28 


48 


54 


40 


31 


22 


40 


r — — — i 

■ 294 • 




F 


0 


0 


1 


0 


0 


1 


4 


9 


10 


6 


10 


25 


66 


Severe 


M 


0 


0 


0 


0 


0 


2 


1 


2 


9 


6 


10 


27 


57 


over 1 9 


Total 


0 


0 


1 


0 


0 


3 


5 


11 


19 


12 


20 


52 


1231 




Total 


16 


28 


55 


54 


63 


70 


85 


84 


66 


47 


44 


92 






F 


4 


5 


8 


4 


8 


4 


3 


1 


0 


0 


0 


0 


37 


Normal 


M 


0 


9 


8 


8 


3 


5 


3 


6 


0 


0 


0 


0 


42 




Total 


4 


14 


16 


12 


11 


9 


6 


7 


0 


0 


0 


0 


1 "79f! 



As a result, one can utilize the methods of class prediction analysis described herein in 
5 order to determine whether an individual will develop late stage OA by identifying individuals 
with early stages of OA. 

As additional samples are obtained, for example during clinical trials, their expression 
profiles can be determined and correlated with the relevant subject data in the database and 
likewise be recorded in said database. Algorithms as described above can be used to query 
10 additional samples against the existing database to further refine the predictive determination by 
allowing an even greater association between the prediction of OA and gene expression 
signature. 

The prediction of late stage OA may thus be performed by detecting the expression level 
of two or more genes, three or more genes, four or more genes, five or more genes, six or more 
15 genes, seven or more genes, eight or more genes, nine or more genes, ten or more genes, fifteen 
or more genes, twenty or more genes thirty or more genes, fifty or more genes, one hundred or 
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more genes, two hundred or more genes, three hundred or more genes, five hundred or more 
genes or all of the genes disclosed for identifying mild OA. 

Known Nucleic acid Sequences or ESTs and Novel Nucleic acid Sequences or ESTs 

5 An EST that exhibits a significant match (> 65%, and preferably 90% or greater, identity) 

to at least one existing sequence in an existing nucleic acid sequence database is characterised as 
a "known" sequence according to the invention. Within this category, some known ESTs match 
to existing sequences which encode polypeptides with known function(s) and are referred to as a 
"known sequence with a function". Other "known" ESTs exhibit a significant match to existing 
10 sequences which encode polypeptides of unknown function(s) and are referred to as a "known 
sequence with no known function". 

EST sequences which have no significant match (less than 65% identity) to any existing 
sequence in the above cited available databases are categorised as novel ESTs. To identify a 
novel gene from an EST sequence, the EST is preferably at least 150 nucleotides in length. 
15 More preferably, the EST encodes at least part of an open reading frame, that is, a nucleic acid 
sequence between a translation initiation codon and a termination codon, which is potentially 
translated into a polypeptide sequence. 

The following references were cited herein: 
20 Claudio JO et al. (1998). Genomics 50:44-52. 

Chelly J et al. (1989). Proc. Nat. Acad ScL USA. 86:2617-2621. 

Chelly J et al (1988). Nature 333:858-860. 

Drews J & Ryser S (1997). Nature Biotech. 15:1318-9. 

Ferrie RM a/. (1992). Am. J. Hum. Genet. 51:251-62. 
25 Fu D-J et al. (1998). Nat. Biotech 16: 381-4. 

Gala JL etal. (1998). Clin. Chem. 44(3):472-81. 

Geisterfer-Lowrance AAT et al (1990). Cell 62:999-1006. 

Groden J et al. (1991). Cell 66:589-600. 

Hwang DM et al. (1997). Circulation 96:4146-4203. 
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Jandreski MA & Liew CC (1987). Hum. Genet. 76:47-53. 

Jin O et al. (1990). Circulation 82:8-16 

Kimoto Y (1998). Mol. Gen. Genet 258:233-239. 

KosterMef a/. (1996). Nat. Biotech 14: 1123-8. 
5 Liew & Jandreski (1 986). Proc. Nat. Acad. Sci. USA. 83:3 1 75-3 179 

Liew CC et al. (1990). Nucleic Acids Res. 18:3647-3651. 

Liew CC (1993). J Mol. Cell. Cardiol. 25:891-894 

Liew CC et al. (1994). Proc. Natl. Acad. Sci. USA. 91:10645-10649. 

Liew et al. (1997). Mol. and Cell. Biochem. 172:81-87. 
10 Niimura H et al. (1998). New Eng. J. Med. 338:1248-1257. 

Ogawa M (1993). Blood 81:2844-2853. 

Santoro IM & Groden J (1997). Cancer Res. 57:488-494. 

Yuasa T et al. (1998). Japanese J. Cancer Res. 89:879-882. 
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Description of Tables : 

Table 1 : Overlap of Genes Expressed in Blood 

(Estimated from about 5,100 unique known genes from the over 25,000 ESTs obtained from 
human blood cDNA libraries). 

5 Table 2: Comparison of approximately 5,140 Unique Genes Identified in the Blood 
Cell cDNA Library to Genes Previously Identified in Specific Tissues 
Column 1 : List of unique genes derived from 25,000 known ESTs from blood cells. 
Column 2: Number of genes found in randomly sequenced ESTs from blood cells. 
Column 3: Accession number. 

10 Column 4: indicates the presence of the unique gene in publicly available cDNA libraries 
of blood (Bl), brain (Br), heart (H), kidney (K), liver (Li) and lung (Lu). **Comparison to 
previously identified tissue-specific genes was determined using the GenBank of the National 
Centre of Biotechnology Information (NCBI) Database. 



15 Table 3 shows genes that are differentially expressed in blood samples from patients with 
different diseases as compared to blood samples from healthy patients. 

Table 3 A shows the identity of those genes that are differentially expressed in blood samples 
from patients with osteoarthritis and hypertension as compared with normal patients as 
depicted in Figure 8 

20 Table 3B shows the identity of those genes that are differentially expressed in blood samples 
from patients with osteoarthritis and obesity as compared with normal patients as depicted in 
Figure 9. 

Table 3C shows the identity of those genes that are differentially expressed in blood samples 
from patients with osteoarthritis and allergies as compared with normal patients as depicted in 
25 Figure 10. 
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Table 3D shows the identity of those genes that are differentially expressed in blood samples 
from patients with osteoarthritis and subject to systemic steroids as compared with normal 
patients as depicted in Figure 1 1 . 

Table 3E shows the identity of those genes that are differentially expressed in blood samples 
5 from patients with hypertension as depicted in Figure 12. 

Table 3F shows the identity of those genes that are differentially expressed in blood samples 
from patients obesity as depicted in Figure 13. 

Table 3G shows the identity of those genes that are differentially expressed in blood samples 
from patients with type II diabetes as depicted in Figure 14. 

10 Table 3H shows the identity of those genes that are differentially expressed in blood samples 
from patients with hyperlipidemia as depicted in Figure 15. 

Table 31 shows the identity of those genes that are differentially expressed in blood samples 
from patients with lung disease as depicted in Figure 16. 

Table 3J shows the identity of those genes that are differentially expressed in blood samples 
15 from patients with bladder cancer as depicted in Figure 17. 

Table 3K shows the identity of those genes that are differentially expressed in blood samples 
from patients with bladder cancer as depicted in Figure 18. 

Table 3L shows the identity of those genes that are differentially expressed in blood samples 
from patients with coronary artery disease (CAD) as depicted in Figure 19. 

20 Table 3M shows the identity of those genes that are differentially expressed in blood samples 
from patients with rheumatoid arthritis as depicted in Figure 20. 

Table 3N shows the identity of those genes that are differentially expressed in blood samples 
from patients with depression as depicted in Figure 21. 

Table 30 shows the identity of those genes that are differentially expressed in blood samples 
25 from patients with various stages of osteoarthritis as depicted in Figure 22. 
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Table 3P shows the identity of those genes that are differentially expressed in blood samples 
from patients with hypertension and OA when compared with patients who have OA only 
wherein genes identified in Table 3 A have been removed so as to identify genes which are 
unique to hypertension. 

5 Table 3Q shows the identity of those genes which were identified in Table 3 A which are shared 
with those genes differentially expressed in blood samples from patients with hypertension and 
OA when compared with patients who have OA only. 

Table 3R shows the identity of those genes that are differentially expressed in blood samples 
from patients who are obese and have OA when compared with patients who have OA only and 
10 wherein genes identified in Table 3B have been removed so as to identify genes which are 
unique to obesity. 

Table 3S shows the identify of those genes identified in Table 3B which are shared with those 
genes differentially expressed in blood samples from patients who are obese and have OA when 
compared with patients who have OA. 

15 Table 3T shows the identity of those genes that are differentially expressed in blood samples 
from patients with allergies and OA when compared with patients who have OA only wherein 
genes identified in Table 3C have been removed so as to identify genes which are unique to 
allergies. 

Table 3U shows the identify of those genes identified in Table 3C which are shared with those 
20 genes differentially expressed in blood samples from patients with allergies and OA when 
compared with patients who have OA only. 

Table 3V shows the identity of those genes that are differentially expressed in blood samples 
from patients who are on systemic steroids and have OA when compared with patients who have 
OA only wherein genes identified in Table 3D have been removed so as to identify genes which 
25 are unique to patients on systemic steroids. 
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Table 3W shows the identify of those genes identified in Table 3D which are shared with those 
genes differentially expressed in blood samples from patients who are on systemic steroids and 
have OA when compared with patients who have OA only. 

Table 3X shows the identity of those genes that are differentially expressed in blood samples 
5 from patients with liver cancer as depicted in Figure 25. 

Table 3Y shows the identity of those genes that are differentially expressed in blood samples 
from patients with schizophrenia as depicted in Figure 26. 

Table 3Z shows the identity of those genes that are differentially expressed in blood samples 
from patients with Chagas disease as depicted in Figure 27. 

10 Table 3AA shows the identity of those genes that are differentially expressed in blood samples 
from patients with asthma as depicted in Figure 28. 

Table 3AB shows the identity of those genes that are differentially expressed in blood from 
patients with either mild or severe OA, but for which genes relevant to asthma, obesity, 
hypertension, systemic steroids and allergies have been removed. 

15 Table 3 AC shows the identity of those genes that are differentially expressed in blood from 
patients with schizophrenia as compared with manic depression syndrome (MDS). 

Table 3AD shows the identity of those genes that are differentially expressed in blood from 
patients taking either birth control, prednisone or hormone replacement therapy and presenting 
with OA as depicted in Figure 34. 

20 Table 3AE shows how the incidence of different stages of OA varies with respect to age in 
males and females 

Table 4 shows 102 EST sequences of Tables 3A-3AD with "no-significant match" to known 
gene sequences. 

Table 5 shows a list of genes showing greater than two fold differential expression in CAD 
25 peripheral blood cells vs. normal blood cells. 
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The following examples are given for the purpose of illustrating various embodiments 
of the invention and are not meant to limit the present invention in any fashion. 

EXAMPLE 1 

5 Construction of a cDNA library 

RNA extracted from human tissues (including fetal heart, adult heart, liver, brain, 
prostate gland and whole blood) were used to construct unidirectional cDNA libraries. The 
first mammalian heart cDNA library was constructed as early as 1982. Since then, the 
methodology has been revised and optimal conditions have been developed for construction of 
10 human heart and hematopoietic progenitor cDNA libraries (Liew et al. 9 1984; Liew 1993, 
Claudio et al. 9 1998). Most of the novel genes which were identified by sequence annotation 
can now be obtained as full length transcripts. 



EXAMPLE 2 

1 5 Catalogue of EST database 

Random partial sequencing of expressed sequence tags (ESTs) of cDNA clones from 
the blood cell library was carried out to establish an EST database of blood. The known genes 
as derived from the ESTs were categorized into seven major cellular functions (Hwang, 
Dempsey et al, 1997). The preparation of the chondrocyte-specific EST database is reported in 
20 WO 02/070737, which is hereby incorporated by reference in its entirety. 



EXAMPLE 3 

Differential screening of cDNA library 
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cDNA probes generated from transcripts of each tissue were used to hybridize the 
blood cell cDNA clones or chondrocyte cDNA clones (Liew et aL, 1997; WO 02/070737). 
The "positive" signals which were hybridized with P-labeled cDNA probes were defined as 
genes which shared identity with blood and respective tissues. The "negative" spots which 
were not exposed to P-labeled cDNA probes were considered to be blood-cell-enriched or low 
frequency transcripts. 

EXAMPLE 4 

Reverse transcriptase-polymerase chain reaction (RT-PCR) assay 

RNA extracted from samples of human tissue was used for RT-PCR analysis (Jin et al. 
1990). Three pairs of forward and reverse primers were designed for human cardiac beta- 
myosin heavy chain gene (pMyHC), amyloid precursor protein (APP) gene and adenomatous 
polyposis-coli protein (APC) gene. The PCR products were also subjected to automated DNA 
sequencing to verify the sequences as derived from the specific transcripts of blood. 

EXAMPLE 5 

Detection of tissue specific gene expression in human blood using RT-PCR 

The beta-myosin heavy chain gene (pMyHC) transcript (mRNA) is known to be highly 
expressed in ventricles of the human heart. This sarcomeric protein is important for heart 
muscle contraction and its presence would not be expected in other non-muscle tissues and 
blood. In 1 990, the gene for human cardiac PMyHC was completely sequenced (Liew et al. 
1990) and was comprised of 41 exons and 42 introns. 

The method of reverse transcription polymerase chain reaction (RT-PCR) was used to 
determine whether this cardiac specific mRNA is also present in human blood. A pair of 
primers was designed; the forward primer (SEQ ID No. 3) was on the boundary of exons 21 
and 22, and the reverse primer (SEQ ID No. 4) was on the boundary of exons 24 and 25. This 
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region of mRNA is only present in pMyHC and is not found in the alpha-myosin heavy chain 
gene (aMyHC). 

A blood sample was first treated with lysing buffer and then undergone centrifuge. The 
resulting pellets were further processed with RT-PCR. RT-PCR was performed using the total 
5 blood cell RNA as a template. A nested PCR product was generated and used for sequencing. 
The sequencing results were subjected to BLAST and the identity of exons 21 to 25 was 
confirmed to be from (3MyHC (Figure 1 A). 

Using the same method just described, two other tissue specific genes - amyloid 
precursor protein (APP, forward primer, SEQ ED No. 7; reverse primer, SEQ ID No. 8) found 
10 in the brain and associated with Alzheimer's disease, and adenomatous polyposis coli protein 
(APC) found in the colon and rectum and associated with colorectal cancer (Groden et al 
1991; Santoro and Groden 1997) - were also detected in the RNA extracted from human blood 
(Figure IB). 

15 EXAMPLE 6 

Multiple RT-PCR analysis on a drop of blood from a normal/diseased individual 

A drop of blood was extracted to obtain RNA to carry out quantitative RT-PCR 
analysis. Specific primers for the insulin gene were designed: forward primer (5'- 
GCCCTCTGGGGACCTGAC-3 ' , SEQ ID NO 1) of exon 1 and reverse primer (5'- 

20 CCCACCTGCAGGTCCTCT-3", SEQ ID NO 2) of exons 1 and 2 of insulin gene. Such 

reverse primer was obtained by deleting the intron between the exons 1 and 2. Blood samples 
of 4 normal subjects were assayed. It was found that the insulin gene is expressed in the blood 
and the quantitative expression of the insulin gene in a drop of blood is influenced by fasting 
and non-fasting states of normal healthy subjects (Figure 2). This very low level of expression 

25 of the insulin gene reflects the phenotypic status of a person and strongly suggests that there is 
a physiological and pathological role for its expression, contrary to the basal or illegitimate 
theory of transcription suggested by Chelly et al (1989) and Kimoto (1998). 
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Same quantitative RT-PCR analysis was performed using insulin specific primers on 
RNA samples extracted from a drop of blood from a normal healthy person, a person having 
late-onset diabetes (Type II) and a person having asymptomatic diabetes. It was found that the 
insulin gene is expressed differentially amongst subjects that are healthy, diagnosed as type II 
5 diabetic, and also in an asymptomatic preclinical patient (Figure 3). 

Similarly, specific primers for the atrial natriuretic factor (ANF) gene were designed 
(forward primer, SEQ ED No. 5; reverse primer, SEQ ID No. 6) and RT-PCR analysis was 
performed on a drop of blood. ANF is known to be highly expressed in heart tissue biopsies 
and in the plasma of heart failure patients. However, atrial natriuretic factor was observed to 
10 be expressed in the blood and the expression of the atrial natriuretic factor gene is significantly 
higher in the blood of patients with heart failure as compared to the blood of a normal control 
patient. 

Specific primers for the zinc finger protein gene (ZFP, forward primer, SEQ ID No. 9; 
reverse primer, SEQ ID No. 10) were also designed and RT-PCR analysis was performed on a 

15 drop of blood. ZFP is known to be high in heart tissue biopsies of cardiac hypertrophy and 
heart failure patients. In the present study, the expression of ZFP was observed in the blood as 
well as differential expression levels of ZFP amongst the normal, diabetic and asymptomatic 
preclinical subjects (Figure 4); although neither of the non-normal subjects has been 
specifically diagnosed as suffering from cardiac hypertrophy and/or heart failure, the higher 

20 expression levels of the ZFP gene in their blood may indicate that these subjects are headed in 
that general direction. 

It was hypothesized that a housekeeping gene such as glyceraldehyde dehydrogenase 
(GADH) which is required and highly expressed in all cells would not be differentially 
expressed in the blood of normal vs. disease subjects. This hypothesis was confirmed by RT- 
25 PCR using GADH specific primers (Figure 4). Thus, GADH is useful as an internal control. 

Standardized levels of insulin gene or ZFP gene expressed in a drop of blood were 
estimated using a housekeeping gene as an internal control relative to insulin or ZFP expressed 
(Figures 5 A & 5B). The levels of insulin gene expressed in each fractionated cell from whole 
blood were also standardized and shown in Figure 5C. 
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EXAMPLE 7 

Human blood cell cDNA library 

5 In order to further substantiate the present invention, differential screening of the 

human blood cell cDNA library was conducted. cDNA probes derived from human blood, 
adult heart or brain were respectively hybridized to the human blood cDNA library clones. As 
shown in Figure 7, more than 95% of the "positively" identified clones are identical between 
the blood and other tissue samples. 

10 DNA sequencing of randomly selected clones from the human whole blood cell cDNA 

library was also performed. This allowed information regarding the cellular function of blood 
to be obtained concurrently with gene identification. More than 20,000 expressed sequence 
tags (ESTs) have been generated and characterized to date, 17.6% of which did not result in a 
statistically significant match to entries in the GenBank databases and thus were designated as 

15 "Novel" ESTs. These results are summarized in Figure 7 together with the seven cellular 
functions related to percent distribution of known genes in blood and in the fetal heart. 

. From 20,000 ESTs, 1,800 have been identified as known genes which may not all 
appear in the hemapoietic system. For example, the insulin gene and the atrial natriuretic 
factor gene have not been detected in these 20,000 ESTs but their transcripts were detected in a 
20 drop of blood, strongly suggesting that all transcripts of the human genome can be detected by 
performing RT-PCR analysis on a drop of blood. 

In addition, approximately 400 novel genes have been identified from the 20,000 ESTs 
characterized to date, and these will be subjected to full length sequencing and open reading 
frame alignment to reduce the actual number of novel ESTs prior to screening for disease 
25 markers. 

Analysis of the approximately 6,283 ESTs which have known matches in the GenBank 
databases revealed that this dataset represents over 1,800 unique genes. These genes have been 
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catalogued into seven cellular functions. Comparisons of this set of unique genes with ESTs 
derived from human brain, heart, lung and kidney demonstrated a greater than 50% overlap in 
expression (Table 1). 

TABLE 1 



Tissue 


UniGene* 


Overlap 


Brain 


19,158 


70% 


Heart 


17,021 


67% 


Kidney 


19,414 


69% 


Liver 


22,836 


71% 


Lung 


22,209 


75% 



* Known gene cluster numbers found in a corresponding tissue in UniGene. 

There are about 5,100 unique known genes from the over 25,000 ESTs obtained from 
human blood cDNA libraries. These genes were searched against human UniGene, Build #160 
(with a total of 1 1 1 ,064 clusters). 



EXAMPLE 8 
Blood cell ESTs 

The results from the differential screening clearly indicate that the transcripts expressed 
in the whole blood are reflective of genes expressed in all cells and tissues of the body. More 
than 95% of detectable spots were identical from two different tissues. The remaining 5% of 
spots may represent cell- or tissue-specific transcripts; however, results obtained from partial 
sequencing to generate ESTs of these clones revealed most of them not to be cell- or tissue- 
specific transcripts. Therefore, the negative spots are postulated to be reflective of low 
abundance transcripts in the tissue from which the cDNA probes were derived. 

An alternative approach that was employed to identify transcripts expressed at low 
levels is the large-scale generation of expressed sequence tags (ESTs). There is substantial 
evidence regarding the efficiency of this technology to detect previously characterized (known) 
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and uncharacterized (unknown or novel) genes expressed in the cardiovascular system (Hwang 
& Dempsey et al. 1997). In the present invention, 20,000 ESTs have been produced from a 
human blood cell cDNA library and resulted in the identification of approximately 1,800 
unique known genes (Table 2) 

5 In the most recent GenBank release, analysis of more than 300,000 ESTs in the 

database (dbESTs) generated more than 48,000 gene clusters which are thought to represent 
approximately 50% of the genes in the human genome. Only 4,800 of the dbESTs are blood- 
derived. In the present invention, 20,000 ESTs have been obtained to date from a human blood 
cDNA library, which provides the world's most informative database with respect to blood cell 

10 transcripts. From the limited amount of information generated so far (i.e. 1,800 unique genes), 
it has already been determined that more than 50% of the transcripts are found in other cells or 
tissues of the human body (Table 2). Thus, it is expected that by increasing the number of 
ESTs generated, more genes will be identified that have an overlap in expression between the 
blood and other tissues. Furthermore, the transcripts for several genes which are known to 

15 have tissue-restricted patterns of expression (i.e. (iMyHC, APP, APC, ANF, ZFP) have also 
been demonstrated to be present in blood. 

Most recently, a cDNA library of human hematopoietic progenitor stem cells has also 
been constructed. From the limited set of 1,000 ESTs, there are at least 200 known genes that 
are shared with other tissue related genes (Claudio et al 1998). 

20 Table 2 demonstrates the expression of known genes of specific tissues in blood cells. 

Previously, only the presence of "housekeeping" genes would have been expected. 
Additionally, the presence of at least 25 of the currently known 500 genes corresponding to 
molecular drug targets was detected. These molecular drug targets are used in the treatment of 
a variety of diseases which involve inflammation, renal and cardiovascular function, neoplastic 

25 disease, immunomodulation and viral infection (Drews & Ryser, 1997). It is expected that 
additional novel ESTs will represent future molecular drug targets. 
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EXAMPLE 9 

Blood cDNA chip Microarray Data Analysis of gene expression profiles of blood 
samples from individuals having coronary artery disease as compared with gene expression 
5 profiles from normal individuals. 

A microarray was constructed using cDNA clones from a human peripheral blood cell 
cDNA library, as described herein. A total of 10,368 polymerase chain reaction (PCR) 
products of the clones from the human peripheral blood cell cDNA library described herein 
were arrayed using GNS 417 arrayer (Affymetrix). RNA for microarray analysis was isolated 
10 from whole blood samples obtained from three male and one female patients with coronary 
heart disease (80 - 90% stenosis) receiving vascular extension drugs and awaiting bypass 
surgery, and three healthy male controls. 

A method of high-fidelity mRNA amplification from Ipg of total RNA sample was 
used. Cy5- or Cy3-dUTP was incorporated into cDNA probes by reverse transcription of anti- 

15 sense RNA, primed by oligo-dT. Labeled probes were purified and concentrated to the desired 
volume. Pre- hybridization and hybridization were performed following Hegde's protocol 
(Hegde P et al., A concise guide to cDNA microarray analysis. Biotechniques 2000;29: 548 - 
56). After overnight hybridization and washing, hybridization signals were detected with a 
GMS 418 scanner at 635-nm (Cy5) and 532-nm (Cy3) wave lengths (see Figure 24). Two 

20 RNA pools were labeled alternatively with Cy5- and Cy3-dUTP, and each experiment was 
repeated twice. Cluster analysis using GeneSpring™ 4.1 .5 (Silicon Genetics) revealed two 
distinct groups consisting of four CAD and three normal control samples. Two images 
scanned at different wavelengths were super- imposed. Individual spots were identified on a 
customized grid. Of 10,368 spots, 10,012 (96.6%) were selected after the removal of spots 

25 with irregular shapes. Data quality was assessed with values of ChlGTB2 and Ch2GTB2 
provided by ScanAlyze. Only spots with CWGTB2 and Ch2GTB2 over 0.50 were selected. 
After evaluation of signal intensities, 8750 (84.4%) spots were left. Signal intensities were 
normalized using a scatter-plot of the signal intensities of the two channels. After 
normalization, the expression ratios of p-actin were 1.00 + 021 1 11+ 0.22, 1.14 + 0.20 and 
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1 .30 + 0. 1 8 (24 samples of p-actin were spotted on this slide as the positive control) in the four 
images. Gene differential expression was assessed as the ratio of two wave-length signal 
intensities. Spots showing a differential expression more than twofold in all four experiments 
were identified as peripheral blood cell, differentially expressed candidate genes in CAD. 108 
5 genes are differentially expressed in CAD peripheral blood cells. 43 genes are downregulated 
in CAD blood cells and 65 are upregulated (see Table 5). Functional characterization of these 
genes shows that differential expression takes place in every gene functional category, 
indicating that profound changes occur in CAD blood cells. 

The differential expression of three genes, pro-platelet basic protein (PBP), platelet 
10 factor 4 (PF4) and coagulation factor XIII Al (F13A), initially identified in the microarray data 
analysis, was further examined by reverse transcriptase-PCR (RT-PCR) using the Titan One- 
tube RT-PCR kit (Boehringer Mannheim). Reaction solution contains 0.2 mM each dNTP, 5 
mM DTT, 1.5 mM MgCl 0.1 pg of total RNA from each sample and 20 pmol each of left and 
right primers of PBP (5'- GGTGCTGCTGCTTCTGTCAT-3 ? and 5 -GGC AG ATTTT 
15 CCTCCCATCC-3 1 ), F13A (S'-AGTCCACCGTGCTAACCATC-S' and 5'- 

AGGGAGTCACTGCTCATGCT-3') and PF4 (5' GTTGCTGCTCCTGCCACTT 3' and 5' 
GTGGCTATCAGTTGGGCAGT-3'). RT-PCR steps are as follows: 1. reverse-transcription: 
30 min at 60 °C; 2. PCR: 2 min at 94 °C, followed by 30 - 35 cycles (as optimized for each 
gene) for 30 s at 94 °C, 30 s at optimized annealing temperature and 2 min at 68 °C; 3. final 
20 extension: 7 min at 68 °C. PCR products were electrophoresed on 1.5% agarose gels. Human 
(P-actin primers (5 -GCGAGAAGATGACCCAGATCAT-3' and 5'- 
GCTCAGGAGGAGCAATGATCTT-3 1 ) were used as the internal control. The RT-PCR 
analysis confirmed that the expression of the three secreted proteins: PBP, PF4 and F13A were 
all upregulated in CAD blood cells (see Figure 23). 
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TABLE 5 



Accession 
number 



Upregulated gene in 
CAD 

REV3-like, catalytic 
subunit of DNA 
polymerase zeta 
TGFB1 -induced anti- 
apoptotic factor 1 
A disintegrin and 
metalloproteinase domain 
10 

Centaurin, delta 2 
Chloride intracellular 
channel 4 

Endothelin receptor typeA 
Glutamate receptor, 
ionotropic 

Mitogen-activated protein 
kinase 7 

Mitogen-activated protein 
kinase kinase kinase 7 
Myristoylated alanine-rich 
protein kinase C substrate 
NIMA-related kinase 7 
PAK2 

Phospholipid scramblase 
1 

Serum deprivation 

response 

Adducin 3 

Desmin 

Fibromodulin 

Laminin, beta 2 

Laminin, beta 3 

Osteonectin 

CD59 antigen p18-20 

Clusterin 

F13A ' 



Fold 
(avera 

ge) 



Functional 
category 



Protein 
Accession 
Number 



AF035537 


2.3 


Cell cycle 


NP_ 


002903 


D86970 


2.2 


Cell cycle 


NP_ 


.510880 


AA044656 


2.7 


Cell signaling 


NP_ 


.001101 


AA351412 


2 


Cell signaling 


NP 


631920 


AA411940 


2.2 


Cell signaling 


NP_ 


.039234 


D90348 


2.1 


Cell signaling 


NP 


001948 


N33821 


2.4 


Cell signaling 


NP_ 


.777567 


L38486 


. 3.7 


CpII ^innalinn 


NP. 


.002395 


AB009356 


4 5 


CpII ^innpilinn 


NP_ 


.663306 


D10522 


2 5 


CpII ^innalinn 


NP_ 


.002347 


AA093324 


3.5 


Cell sianalina 

v*/ 1 1 oiy i lain iu 


NP 


.598001 


AA262968 


3.5 


Cell sianalina 


Q13177 


AA054476 


3 3 


CpII sianalina 

V— / w 1 1 OIUI IQIII |U 


NP_ 


.066928 


Z30112 

i— W \J 1 1 t— 


4 5 


CpII Qinn^linn 
veil oiyiidiiiiy 


NP. 


.004648 


AA029158 


2.9 


Cell structure 


NP 


063968 


AF 167579 


4.4 


Cell structure 


NP 


001918 


W23613 


2.9 


Cell structure 


NP 


002014 


S77512 


2.2 


Cell structure 


NP 


002283 


L25541 


2.4 


Cell structure 


NP 


000219 


Y00755 


3.1 


Cell structure 


NP 


003109 


W01111 


2.4 


Cell/organism 


NP. 


_000602 






defense 






M64722 


3.5 


Cell/organism 


NP. 


.001822 






defense 






M 14539 


2.1 


Cell/organism 


NP. 


_000120 






defense 
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Defensin, alpha 1 


M26602 


4.2 


Cell/organism 
defense 


NP_ 


004075 


PF4 


M25897 


2.1 


Cell/organism 
defense 


NP_ 


.002610 


PBP 


M 54995 


5.5 


Cell/organism 
defense 


NP_ 


.002695 


E2F transcription factor 3 


D38550 


2.1 


Gene 
expression 


NP_ 


.001940 


Early growth response 1 


M62829 


2.7 


Gene 
expression 


NP_ 


.001955 


Eukaryotic translation 


N86030 


2.3 


Gene 


NP_ 


.001393 


elongation factor 1 alpha 1 






expression 






Eukaryotic translation 


M 15353 


2.1 


Gene 


NP_ 


.001959 


initiation factor 4E 






expression 






F-box and WD-40 domain 


AB014596 


2.7 


Gene 


NP_ 


.387449 


protein 1 B 






expression 






Makorin, ring finger 


AA331966 


2.1 


Gene 


NP_ 


054879 


protein, 2 






expression 






Non-canonical ubiquitin- 


N92776 


2.5 


Gene 


NP_ 


.057420 


conjugating enzyme 1 






expression 






Nuclear receptor subfamily 


Z30425 , 


4.7 


Gene 


NP_ 


.005113 


1, group I, member 3 






expression 






Ring finger protein 1 1 


T08927 


3 


Gene 
expression 


NP_ 


.055187 


Transducin-like enhancer 


M99435 


3.3 


Gene 


NP. 


_005068 


of split 1 






expression 






Alkaline phosphatase, 


ABO 11406 


2.2 


Metabolism 


NP_ 


_000469 


liver/bone/kidney 












Annexin A3 


M63310 


3.4 


Metabolism 


NP 


005130 


Branched chain 


AA336265 


4.8 


Metabolism 


NP_ 


005495. 


aminotransferase 1 , 










1 


cytosolic 












Cytochrome b 


AF042500 


2.5 


Metabolism 






Glutaminase 


D30931 


2.6 


Metabolism 


NP 


055720 


Lysophospholipase I 


AF035293 


2.8 


Metabolism 


NP 


006321 


NADH dehydrogenase 1, 


AA056111 


2.5 


Metabolism 


NP. 


_002485 


subcomplex unknown 1 , 6 












kDa 












Phosphofructokinase 


M26066 


2.2 


Metabolism 


NP 


000280 


Ubiquinol-cytochrome c 


M22348 


2.5 


Metabolism 


NP. 


_006285 


reductase binding protein 












CGI-1 10 protein 


AA341061 


2.4 


Unclassified 


NP 


057131 


Dactylidin 


H95397 


2.7 


Unclassified 


NP 


112225 


Deleted in split-hand/split- 


T24503 


2.4 


Unclassified 


NP. 


,006295 


foot 1 region 












Follistatin-like 1 


R14219 


2.7 


Unclassified 


NP 


009016 
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FUS-interacting protein 1 
Hypothetical protein 
FLJ12619 

Hypothetical protein from 
EUROIMAGE 588495 
Hypothetical protein 
LOC51315 
KIAA1705 protein 

Mesoderm induction early 
response 1 

Phosphodiesterase 4D- 
interacting protein 
Preimplantation protein 3 
Putative nuclear protein 
ORF1-FL49 
Similar to rat nuclear 
ubiquitous casein kinase 2 
Similar to RIKEN 
Spectrin, beta 
Stromal cell-derived factor 
receptor 1 

Thioredoxin-related 
protein 

Transmembrane 4 
superfamily member 2 
Tumor endothelial marker 
8 

Downregulated gene in 
CAD 

CASP8 and FADD-like 

apoptosis regulator 

CD81 antigen 

Cell division cycle 25B 

DEAD/H (Asp-Glu-Ala- 

Asp/His) box polypeptide 

27 

F-box and leucine-rich 
repeat protein 1 1 
Minichromosome 
maintenance deficient 3 
associated protein 
Protein phosphatase 2, 
regulatory subunit A, alpha 
isoform 

Thyroid autoantigen 70 



W37945 

VVHf ZOO 


2.8 

-7 
I 


Unclassified 
Unclassified 


NP_473357 

k i pi a a oon-i 

I\lr_1 12201 


MftQO/17 

INboZ4/ 


O 7 


Unclassified 




A AOC-1 A OO 


o o 


Unclassified 


Nr_057702 


I oUooy 


O 7 
Z.f 


Unclassified 


NP 009121. 
1 

NP_065999 


AI650409 


2.2 


Unclassified 


AA740661 


2.5 


Unclassified 


NP_055459 


D59087 
W33098 


2.5 
2.8 


Unclassified 
Unclassified 


NP 056202 
NP_1 15788 


HUy4o4 


2.2 


Unclassified 


Q9H1E3 


A A OH7>1 A O 

AAzy f41z 
AI334431 

L17-| CCD 


2.5 
2.5 
4.1 


Unclassified 
Unclassified 
Unclassified 


T02670 
Q01082 
NP 8 16929 


AA421549 


2.8 


Unclassified 


NP_1 10437 


D29808 


2.4 


Unclassified 


NP_004606 


D79964 


2.5 


Unclassified 


Kin A A A O/^O 

NP_444262 


AF015450 


0.45 


Cell cycle 


NP_003870 


M33680 
Moiyo4 
AA985699 


0.41 
0.4 
0.42 


Cell cycle 
Cell cycle 
Cell cycle 


NP_004347 
NP 068660 
NP_694705 


R98291 


0.27 


Cell cycle 


NP_036440 


H 10286 


0.43 


Cell cycle 


NP_003897 


J02902 


0.48 


Cell cycle 


NP_055040 


J04607 


0.25 


Cell cycle 


NP_001460 
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kDa 












A disintegrin and 


R32760 


0.37 


Cell signaling 






metalloproteinase domain 
17 












A kinase anchor protein 13 


M90360 


0.31 


Cell signaling 


NP 


658913 


Calpastatin 


AF037194 


0.39 


Cell signaling 


NP 


006471 


Diacylglycerol kinase, 


AF064770 


0.44 


Cell signaling 


NP_ 


001336 


alpha 80 kDa 










gamma-aminobutyric acid 


AJ012187 


0.42 


Cell signaling 


NP_ 


068705 


B receptor, 1 












Inositol polyphosphate-5- 


U84400 


0.41 


Cell signaling 


NP_ 


005532 


phosphatase, 145 kDa 












Lymphocyte-specific 


X05027 


0.45 


Cell signaling 


NP_ 


.005347 


protein tyrosine kinase 












RAP1 B, member of RAS 


P09526 


0.4 


Cell signaling 


P09526 


oncogene family 












Ras association 


AF061836 


0.43 


Cell signaling 


NP_ 


.733835 


(RalGDS/AF-6) domain 










family 1 












CDC42-effector protein 3 


AF 104857 


0.28 


Cell signaling 


NP 


006440 


Leupaxin 


AF062075 


0.31 


Cell signaling 


NP 


004802 


Annexin A6 


D00510 


0.45 


Cell structure 


NP 


004024 


RAN-binding protein 9 


AB008515 


0.41 


Cell structure 


NP 


005484 
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Transcription factor A, 
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expression 
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protein interacting protein 













EXAMPLE 10 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from co-morbid individuals having osteoarthritis and hypertension as compared with gene 
5 expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with osteoarthritis and hypertension as 
compared to blood samples taken from healthy patients. 

As used herein, the term "hypertension" is defined as high blood pressure or elevated 
10 arterial pressure. Patients identified with hypertension herein include persons who have an 
increased risk of developing a morbid cardiovascular event and/or persons who benefit from 
medical therapy designed to treat hypertension. Patients identified with hypertension also can 
include persons having systolic blood pressure of >130 mm Hg or a diastolic blood pressure of 
>90 mm Hg or a person takes antihypertensive medication. 

15 Osteoarthritis (OA), as used herein also known as "degenerative joint disease", represents 

failure of a diarthrodial (movable, synovial-lined) joint. It is a condition, which affects joint 
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cartilage, and or subsequently underlying bone and supporting tissues leading to pain, stiffness, 
movement problems and activity limitations. It most often affects the hip, knee, foot, and hand, 
but can affect other joints as well. 

OA severity can be graded according to the system described by Marshall (Marshall KW. 

5 J Rheumatol, 1996:23(4) 582-85). Briefly, each of the six knee articular surfaces was assigned a 
cartilage grade with points based on the worst lesion seen on each particular surface. Grade 0 is 
normal (0 points), Grade I cartilage is soft or swollen but the articular surface is intact (1 point). 
In Grade II lesions, the cartilage surface is not intact but the lesion does not extend down to 
subchondral bone (2 points). Grade III damage extends to subchondral bone but the bone is 

10 neither eroded nor eburnated (3 points). In Grade IV lesions, there is eburnation of or erosion 
into bone (4 points). A global OA score is calculated by summing the points from all six 
cartilage surfaces. If there is any associated pathology, such as meniscus tear, an extra point will 
be added to the global score. Based on the total score, each patient is then categorized into one 
of four OA groups: mild (1-6), moderate (7-12), marked (13-18), and severe (>18). As used 

15 herein, patients identified with OA may be categorized in any of the four OA groupings as 
described above. 

Blood samples were taken from patients who were diagnosed with osteoarthritis and 
hypertension as defined herein. Gene expression profiles were then analysed and compared to 
profiles from patients unaffected by any disease. In each case, the diagnosis of osteoarthritis and 
20 hypertension was corroborated by a skilled Board certified physician . 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K Chondrogene 
Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
25 expressed in blood samples from patients with disease as compared to healthy patients was 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 
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Figure 8 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals having hypertension and osteoarthritis as compared with gene 
expression profiles from normal individuals. Expression profiles were generated using 
GeneSpring™ software analysis as described herein. Each column represents the hybridization 
5 pattern resulting from a single individual. In this example, hypertensive patients also presented 
with OA, as described herein. Normal individuals have no known medical conditions and were 
not taking any known medication. Hybridizations to create said gene expression profiles were 
done using the ChondroChip™. A dendogram analysis is shown above. Samples are clustered 
and marked as representing patients who are hypertensive or normal. The "*" indicates those 
10 patients who abnormally clustered as either hypertensive, or normal despite presenting with the 
reverse. The number of hybridizations profiles determined for either hypertensive patients or 
normal individuals are shown. 861 differentially expressed genes were identified as being 
differentially expressed with a p value of < 0.05 as between the hypertensive patients and normal 
individuals. The identity of the differentially expressed genes is shown in Table 3 A. 

15 Classification or class prediction of a test sample as either having hypertension and OA 

or being normal can be done using the differentially expressed genes as shown in Table 3 A in 
combination with well known statistical algorithms for class prediction as would be understood 
by a person skilled in the art and described herein. Commercially available programs such as 
those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

20 

EXAMPLE 10A 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from individuals having osteoarthritis and hypertension as compared with gene expression 
profiles from patients having osteoarthritis only. 

25 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from co-morbid patients with osteoarthritis and hypertension 
as compared to blood samples taken from OA patients only. 
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Blood samples were taken from patients who were diagnosed with osteoarthritis and 
hypertension as defined herein. Gene expression profiles were then analysed and compared to 
profiles from patients having OA only. In each case, the diagnosis of osteoarthritis and/or 
hypertension was corroborated by a skilled Board certified physician . 

5 Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 

using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K Chondrogene 
Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
expressed in blood samples from patients with disease as compared to OA patients only was 
10 determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

Expression profiles were generated using GeneSpring™ software analysis as described 
herein (data not shown). The gene list generated from this analysis was identified and those 
genes previously identified in Table 3 A removed so as to identify those genes which are unique 
to hypertension. 790 differentially expressed genes were identified as being differentially 
expressed with a p value of < 0.05 as between the OA and hypertensive patients when compared 
with OA individuals. 577 genes were identified as unique to hypertension. The identity of these 
differentially expressed genes are shown in Table 3P. A gene list is also provided of the 213 
genes which were found in common as between those genes identified in Table 3 A and genes 
differentially expressed in blood samples taken from patients with osteoarthritis and 
hypertension as compared to blood samples taken from OA patients only. The identity of these 
intersecting differentially expressed genes is shown in Table 3Q and a venn diagram showing the 
relationship between the various groups of gene lists is found in Figure 29. 

25 Classification or class prediction of a test sample as having hypertension or not having 

hypertension can be done using the differentially expressed genes as shown in Table 3P as the 
predictor genes in combination with well known statistical algorithms as would be understood by 
a person skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 
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Classification of individuals as having both OA and hypertension using the genes in Table 3Q 
can also be performed. 

EXAMPLE 11 

5 ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 

from co-morbid individuals having osteoarthritis and obesity as compared with gene expression 
profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with obesity and OA as compared to blood 
10 samples taken from healthy patients. 

As used herein, "obesity" is defined as an excess of adipose tissue that imparts a health 
risk. Obesity is assessed in terms of height and weight in the relevance of age. Patients who are 
considered obese include, but are not limited to, patients having a body mass index or BMI 
((defined as body weight in kg divided by (height in meters) 2 ) greater than or equal to 30.0. 

15 Blood samples were taken from patients who were diagnosed with osteoarthritis and 

obesity as defined herein. Gene expression profiles were then analysed and compared to profiles 
from patients unaffected by any disease. In each case, the diagnosis of the disease was 
corroborated by a skilled Board certified physician. Total mRNA from a drop of peripheral 
whole blood taken from each patient was isolated using TRIzol® reagent (GIBCO) and 

20 fluorescently labeled probes for each blood sample were generated as described above. Each 
probe was denatured and hybridized to a 15K Chondrogene Microarray Chip (ChondroChip™) 
as described herein. Identification of genes differentially expressed in blood samples from 
patients with disease as compared to healthy patients was determined by statistical analysis using 
the Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, 

25 USA: McGraw-Hill Medical Publishing Division, 2002). 

Figure 9 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were identified as obese as described herein as compared with 



60 



Attorney Docket No.: 423 1/2055C 



gene expression profiles from normal individuals. Expression profiles were generated using 
GeneSpring™ software analysis as described herein. Each column represents the hybridization 
pattern resulting from a single individual. In this example, obese patients also presented with 
OA, as described herein. Normal individuals have no known medical conditions and were not 
5 taking any known medication. Hybridizations to create said gene expression profiles were 
done using the ChondroChip™. A dendogram analysis is shown above. Samples are clustered 
and marked as representing patients who are obese or normal The "*" indicates those patients 
who abnormally clustered as either obese or normal despite presenting with the reverse. The 
number of hybridization profiles determined for obese patients with OA and normal 
10 individuals are shown. 913 genes were identified as being differentially expressed with a p 
value of < 0.05 as between the obese patients with OA and normal individuals is noted. The 
identity of the differentially expressed genes is shown in Table 3B. 

Classification or class prediction of a test sample as either having obesity and OA or 
being normal can be done using the differentially expressed genes as shown in Table 3B in 
15 combination with well known statistical algorithms as would be understood by a person skilled 
in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

EXAMPLE 11A 

20 ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 

from co-morbid individuals having osteoarthritis and obesity as compared with gene expression 
profiles from patients having osteoarthritis only. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with obesity and OA as compared to blood 
25 samples taken from patients with OA only. 

Blood samples were taken from patients who were diagnosed with osteoarthritis and 
obesity as defined herein. Gene expression profiles were then analysed and compared to profiles 
from patients affected by OA only. 

61 



Attorney Docket No.: 4231/2055C 

In each case, the diagnosis of the disease was corroborated by a skilled Board certified 
physician. Total mRNA from a drop of peripheral whole blood taken from each patient was 
isolated using TRIzol® reagent (GEBCO) and fluorescently labeled probes for each blood sample 
were generated as described above. Each probe was denatured and hybridized to a 15K 
5 Chondrogene Microarray Chip (ChondroChip™) as described herein. Identification of genes 
differentially expressed in blood samples from patients with obesity and OA as compared to OA 
patients only was determined by statistical analysis using the Wilcox Mann Whitney rank sum 
test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Expression profiles were generated using GeneSpring™ software analysis as described 
herein (data not shown). 671 genes were identified as being differentially expressed with a p 
value of < 0.05 as between the obese patients with OA and those patients with only OA. Those 
genes previously identified in Table 3B were removed so as to identify those genes which are 
unique to obesity. The identity of these 519 genes unique to obesity are shown in Table 3R. A 
gene list is also provided of those genes which were found in common as between those genes 
identified in Table 3B and genes differentially expressed in blood samples taken from patients 
with osteoarthritis and obesity as compared to blood samples taken from OA patients only. 152 
genes are shown in Table 3S. A venn diagram showing the relationship between the various 
groups of gene lists is found in Figure 30. 

20 Classification or class prediction of a test sample as having obesity or not having obesity 

can be done using the differentially expressed genes as shown in Table 3R as the predictor genes 
in combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those provided 
by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. Classification 

25 of individuals as having both OA and obesity using the genes in Table 3S can also be performed. 

EXAMPLE 12 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from co-morbid individuals having osteoarthritis and allergies as compared with gene expression 
profiles from normal individuals. 
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This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with allergies as compared to blood samples 
taken from healthy patients. 

As used herein, "allergies" encompasses diseases and conditions wherein a patient 
5 demonstrates a hypersensitive or allergic reaction to one or more substances or stimuli such as 
drugs, food stuffs, plants, animals etc. and as a result has an increased immune response. Such 
immune responses can include anaphylaxis, allergic rhinitis, asthma, skin sensitivity such as 
urticaria, eczema, and allergic contact dermatitis and ocular allergies such as allergic 
conjunctivitis and contact allergy. Patients identified as having allergies includes patients having 
10 one or more of the above noted conditions. 

Blood samples were taken from patients who were diagnosed with osteoarthritis and 
allergies as defined herein. These patients are classified as presenting with co-morbidity, or 
multiple disease states. Gene expression profiles were then analysed and compared to profiles 
from patients unaffected by any disease. In each case, the diagnosis of osteoarthritis and allergies 
15 was corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GEBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K Chondrogene 
Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
20 expressed in blood samples from patients with osteoarthritis and allergies as compared to healthy 
patients was determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
(Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing 
Division, 2002). 

Figure 10 shows a diagrammatic representation of gene expression profiles of blood 
25 samples from individuals who were identified as having allergies as described herein as 
compared with gene expression profiles from normal individuals. Expression profiles were 
generated using GeneSpring™ software analysis as described herein. Each column represents 
the hybridization pattern resulting from a single individual. In this example, patients with 
allergies also presented with OA, as described herein. Normal individuals had no known 
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medical conditions and were not taking any known medication. Hybridizations to create said 
gene expression profiles were done using the ChondroChip™ . A dendogram analysis is 
shown above. Samples are clustered and marked as representing patients who are obese or 
normal. The "*" indicates those patients who abnormally clustered as either having allergies or 
5 being normal despite presenting with the reverse. The number of hybridizations profiles 
determined for patients with allergies and normal individuals are shown. 633 genes were 
identified as being differentially expressed with a p value of < 0.05 as between patients with 
allergies and normal individuals is noted. The identity of the differentially expressed genes is 
shown in Table 3C. 

10 Classification or class prediction of a test sample as either having allergies and OA or 

being normal can be done using the differentially expressed genes as shown in Table 3C in 
combination with well known statistical algorithms as would be understood by a person skilled 
in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

15 

EXAMPLE 12A 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from individuals having osteoarthritis (OA) and allergies as compared with gene expression 
profiles from individuals with OA only. 

20 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from patients with allergies and OA as compared to blood 
samples taken from OA patients. 

Blood samples were taken from patients who were diagnosed with osteoarthritis and 
allergies as defined herein. Gene expression profiles were then analysed and compared to 
25 profiles from patients affected by OA only. In each case, the diagnosis of osteoarthritis and 
allergies was corroborated by a skilled Board certified physician. 
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Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K Chondrogene 
Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
5 expressed in blood samples from patients with osteoarthritis and allergies as compared to OA 
patients only was determined by statistical analysis using the Wilcox Mann Whitney rank sum 
test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Expression profiles were generated using GeneSpring™ software analysis as described 
10 herein (data not shown). 498 genes were identified as being differentially expressed with a p 
value of < 0.05 as between patients with allergies and OA as compared with patients with OA 
only. Of the 498 genes identified, those genes previously identified in Table 3C were removed 
so as to identify those genes which are unique to allergies. 257 differentially expressed genes 
were identified as being as unique to allergies. The identity of these differentially expressed 
15 genes are shown in Table 3T. A gene list is also provided of the 241 genes which were found in 
common as between those genes identified in Table 3C and genes differentially expressed in 
blood samples taken from patients with osteoarthritis and allergies as compared to blood samples 
taken from OA patients only. The identity of these intersecting differentially expressed genes is 
shown in Table 3U and a venn diagram showing the relationship between the various groups of 
20 gene lists is found in Figure 3 1 . 

Classification or class prediction of a test sample as having allergies or not having 
allergies can be done using the differentially expressed genes as shown in Table 3T as the 
predictor genes in combination with well known statistical algorithms as would be understood by 
a person skilled in the art and described herein. Commercially available programs such as those 
25 provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 
Classification of individuals as having both OA and allergies using the genes in Table 3U can 
also be performed. 
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EXAMPLE 13 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from co-morbid individuals having osteoarthritis and subject to systemic steroids as compared 
with gene expression profiles from normal individuals 

5 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from patients subject to systemic steroids as compared to 
blood samples taken from healthy patients. 

As used herein, "systemic steroids" indicates a person subjected to artificial levels of 
steroids as a result of medical intervention. Such systemic steroids include birth control pills, 
10 prednisone, and hormones as a result of hormone replacement treatment. A person identified as 
having systemic steroids is one who is on one or more of the following of the above treatment 
regimes. 

Blood samples were taken from patients who were diagnosed with osteoarthritis and 
subject to systemic steroids as defined herein. Gene expression profiles were then analysed and 
15 compared to profiles from patients unaffected by any disease. In each case, the diagnosis of 
osteoarthritis and systemic steroids was corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to the 15K 
20 Chondrogene Microarray Chip (ChondroChip™) as described herein. Identification of genes 
differentially expressed in blood samples from patients with osteoarthritis and subject to 
systemic steroids as compared to healthy patients was determined by statistical analysis using the 
Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, 
USA: McGraw-Hill Medical Publishing Division, 2002). 

25 Figure 1 1 shows a diagrammatic representation of gene expression profiles of blood 

samples from individuals who were subject to systemic steroids as described herein as 
compared with gene expression profiles from normal individuals. Expression profiles were 
generated using GeneSpring™ software analysis as described herein. Each column represents 
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the hybridization pattern resulting from a single individual In this example, patients taking 
systemic steroids also presented with OA, as described herein. Normal individuals have no 
known medical conditions and were not taking any known medication. Hybridizations to 
create said gene expression profiles were done using the ChondroChip™ . (A dendogram 
5 analysis is shown above. Samples are clustered and marked as representing patients who are 
taking systemic steroids or normal The "*" indicates those patients who abnormally clustered 
as either systemic steroids or normal despite presenting with the reverse. The number of 
hybridizations profiles determined for patients with systemic steroids and normal individuals 
are shown. 605 genes were identified as being differentially expressed with a p value of < 0.05 
10 as between patients with systemic steroids and normal individuals is noted. The identity of the 
differentially expressed genes is shown in Table 3D. 

Classification or class prediction of a test sample from a patient as indicating said patient 
takes systemic steroids and has OA or as being normal can be done using the differentially 
expressed genes as shown in Table 3A in combination with well known statistical algorithms for 
15 class prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

EXAMPLE 13A 

20 ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 

from co-morbid individuals having osteoarthritis and subject to systemic steroids as compared 
with gene expression profiles from with osteoarthritis only. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients subject to systemic steroids and having OA as 
25 compared to blood samples taken from OA patients only. 

Blood samples were taken from patients who were diagnosed with osteoarthritis and 
subject to systemic steroids as defined herein. Gene expression profiles were then analysed and 
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compared to profiles from patients having OA only. In each case, the diagnosis of osteoarthritis 
and systemic steroids was corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 

5 generated as described above. Each probe was denatured and hybridized to the 15K 

Chondrogene Microarray Chip (ChondroChip™) as described herein. Identification of genes 
differentially expressed in blood samples from patients with osteoarthritis and subject to 
systemic steroids as compared patients with OA only was determined by statistical analysis using 
the Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, 

10 USA: McGraw-Hill Medical Publishing Division, 2002). 

Expression profiles were generated using GeneSpring™ software analysis as described 
herein (data not shown). 553 genes were identified as being differentially expressed with a p 
value of < 0.05 as between patients taking systemic steroids and OA as compared with patients 
with OA only. Of the 553 genes identified, those genes previously identified in Table 3D were 

15 removed so as to identify those genes which are unique to systemic steroids. 362 differentially 
expressed genes were identified as being as unique to systemic steroids. The identity of these 
differentially expressed genes are shown in Table 3 V. A gene list is also provided of the 191 
genes which were found in common as between those genes identified in Table 3D and genes 
differentially expressed in blood samples taken from patients with osteoarthritis and systemic 

20 steroids as compared to blood samples taken from OA patients only. The identity of these 
intersecting differentially expressed genes is shown in Table 3 W and a venn diagram showing 
the relationship between the various groups of gene lists is found in Figure 32. 

Classification or class prediction of a test sample of an individual as either taking 
systemic steroids or not taking systemic steroids can be done using the differentially expressed 
25 genes as shown in Table 3 V as the predictor genes in combination with well known statistical 
algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. Classification of individuals as having 
both OA and taking systemic steroids using the genes in Table 3W can also be performed. 
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EXAMPLE 13B 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from co-morbid individuals having osteoarthritis and subject to systemic steroids as compared 
with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients subject to various specific systemic steroids as 
compared to blood samples taken from healthy patients, and the ability to categorize and 
differentiate as between the systemic steroid being taken. 

As used herein, "systemic steroids" indicates a person subjected to artificial levels of 
steroids as a result of medical intervention. Such systemic steroids include birth control pills, 
prednisone, and hormones as a result of hormone replacement treatment. A person identified as 
having systemic steroids is one who is on one or more of the following of the above treatment 
regimes. 

Blood samples were taken from patients who were diagnosed with osteoarthritis and 
subject to systemic steroids as defined herein. Gene expression profiles were then analysed and 
compared as between the systemic steroids as compared to profiles from patients unaffected by 
any disease. In each case, the diagnosis of osteoarthritis and systemic steroids was corroborated 
by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to the 15K 
Chondrogene Microarray Chip (ChondroChip™) as described herein. Identification of genes 
differentially expressed in blood samples from patients with osteoarthritis and subject to 
systemic steroids as compared to healthy patients was determined by statistical analysis using the 
Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, 
USA: McGraw-Hill Medical Publishing Division, 2002). 
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Figure 34 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were subject to either birth control, prednisone, or hormone 
replacement therapy as described herein as compared with gene expression profiles from 
normal individuals. Expression profiles were generated using GeneSpring™ software analysis 
5 as described herein. Each column represents the hybridization pattern resulting from a single 
individual. In this example, patients taking with each of the systemic steroids also presented 
with OA, as described herein. Normal individuals have no known medical conditions and were 
not taking any known medication. Hybridizations to create said gene expression profiles were 
done using the ChondroChip™. A dendogram analysis is shown above. Samples are clustered 

10 and marked as representing patients who are taking birth control, prednisone, hormone 

replacement therapy or normal. The "*" indicates those patients who abnormally clustered. 
The number of hybridizations profiles determined for patients with birth control, prednisone, 
hormone replacement therapy or normal individuals are shown. 396 genes were identified as 
being differentially expressed with a p value of < 0.05 as between patients with systemic 

15 steroids and normal individuals is noted. The identity of the differentially expressed genes is 
shown in Table 3 AD. 

Classification or class prediction of a test sample from a patient as indicating said patient 
takes systemic steroids and has OA or as being normal can be done using the differentially 
expressed genes as shown in Table 3 AD in combination with well known statistical algorithms 
20 for class prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

EXAMPLE 14 

25 ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 

from individuals having hypertension as compared with gene expression profiles from normal 
individuals. 
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This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with hypertension but without osteoarthritis as 
compared to blood samples taken from healthy patients. 

As used herein, the term "hypertension 55 is defined as high blood pressure or elevated 
5 arterial pressure. Patients identified with hypertension herein include persons who have an 
increased risk of developing a morbid cardiovascular event and/or persons who benefit from 
medical therapy designed to treat hypertension. Patients identified with hypertension also can 
include persons having systolic blood pressure of > 130 mm Hg or a diastolic blood pressure of 
>90 mm Hg or a person takes antihypertensive medication. 

10 Blood samples were taken from patients who were diagnosed with hypertension as 

defined herein. Gene expression profiles were then analysed and compared to profiles from 
patients unaffected by any disease. In each case, the diagnosis of hypertension was corroborated 
by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K Chondrogene 
Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
expressed in blood samples from patients with hypertension as compared to healthy patients was 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

Figure 12 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals having hypertension as compared with gene expression profiles from 
samples of both non-hypertensive and normal individuals. Expression profiles were generated 
25 using GeneSpring™ software analysis as described herein. Each column represents the 
hybridization pattern resulting from a single individual. Non-hypertensive individuals 
presented without hypertension, but may have presented with other medical conditions and 
may be under various treatment regimes. Normal individuals have no known medical 
conditions and were not taking any known medication. Hybridizations to create said gene 
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expression profiles were done using the ChondroChip™. A dendogram analysis is shown 
above. Samples are clustered and marked as representing patients who are hypertensive, 
normal or non-hypertensive. The "*" indicates those patients who abnormally clustered as 
either hypertensive, non-hypertensive or normal despite actual presentation. The number of 
5 hybridizations profiles determined for hypertensive patients, non-hypertensive patients and 
normal individuals are shown. 1, 993 genes identified as being differentially expressed with a 
p value of < 0.05 as between the hypertensive patients and the combined normal and non- 
hypertensive individuals is noted. The identity of the differentially expressed genes are shown 
in Table 3E. 

10 Classification or class prediction of a test sample of an individual so as to determine 

whether said individual has or does not have hypertension can be done using the differentially 
expressed genes as shown in Table 3E as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 

15 GeneSpring™) for Class Predication are also available. 



EXAMPLE 15 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from individuals having obesity as compared with gene expression profiles from normal 
20 individuals 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with obesity but without osteoarthritis as 
compared to blood samples taken from healthy patients. 

As used herein, "obesity" is defined as an excess of adipose tissue that imparts a health 
25 risk. Obesity is assessed in terms of height and weight in the relevance of age. Patients who are 
considered obese include, but are not limited to, patients having a body mass index or BMI 
((defined as body weight in kg divided by (height in meters) 2 ) greater than or equal to 30.0. 
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Blood samples were taken from patients who were diagnosed with hypertension as 
defined herein. Gene expression profiles were then analysed and compared to profiles from 
patients unaffected by any disease. In each case, the diagnosis of obesity was corroborated by a 
skilled Board certified physician. 

5 Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 

using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a (ChondroChip™) 
as described herein. Identification of genes differentially expressed in blood samples from 
patients with obesity as compared to healthy patients was determined by statistical analysis using 

10 the Wilcox Mann Whitney rank sum test (Glantz SA., Primer of Biostatistics, 5th ed., New York, 
USA: McGraw-Hill Medical Publishing Division, 2002). 

Figure 13 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were identified as obese as described herein as compared with 
gene expression profiles from normal and non-obese individuals. Expression profiles were 
generated using GeneSpring™ software analysis as described herein. Each column represents 
the hybridization pattern resulting from a single individual. Normal individuals have no 
known medical conditions and were not taking any known medication. Non-obese individuals 
presented without obesity, but may have presented with other medical conditions and may be 
under various treatment regimes. Hybridizations to create said gene expression profiles were 
done using the ChondroChip™ . A dendogram analysis is shown above. Samples are 
clustered and marked as representing patients who are obese, normal or non-obese. The "*" 
indicates those patients who abnormally clustered as either obese, normal or non-obese despite 
actual presentation. The number of hybridizations profiles determined for obese patients, non- 
obese patients and normal individuals are shown. 1,147 genes were identified as being 
differentially expressed with a p value of < 0.05 as between the obese patients and the 
combination of normal and non-obese individuals is noted. The identity of the differentially 
expressed genes is shown in Table 3F. 

Classification or class prediction of a test sample as being obese or not being obese can 
be done using the differentially expressed genes as shown in Table 3F as the predictor genes in 
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combination with well known statistical algorithms as would be understood by a person skilled 
in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

EXAMPLE 16 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from individuals having type 2 diabetes as compared with gene expression profiles from normal 
individuals 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with type 2 diabetes but without osteoarthritis as 
compared to blood samples taken from healthy patients. 

As used herein, "diabetes", or "diabetes mellitus" includes both "type 1 diabetes" 
(insulin-dependent diabetes (IDDM)) and "type 2 diabetes" (insulin-independent diabetes 
(NIDDM). Both type 1 and type 2 diabetes characterized in accordance with Harrison's 
Principles of Internal Medicine 14th edition, as a person having a venous plasma glucose 
concentration > 140mg/dL on at least two separate occasions after overnight fasting and venous 
plasma glucose concentration > 200mg/dL at 2 h and on at least one other occasion during the 2- 
h test following ingestion of 75 g of glucose. Patients identified as having type 2 diabetes as 
described herein are those demonstrating insulin-independent diabetes as determined by the 
methods described above. 

Blood samples were taken from patients who were diagnosed with type II diabetes as 
defined herein. Gene expression profiles were then analysed and compared to profiles from 
patients unaffected by any disease. In each case, the diagnosis of type II diabetes was 
corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K Chondrogene 
Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
expressed in blood samples from patients with type 2 diabetes as compared to healthy patients 
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was determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz 
SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

Figure 14 shows a diagrammatic representation of gene expression profiles of blood 
5 samples from individuals who were identified as having type 2 diabetes as described herein as 
compared with gene expression profiles from normal and non-type 2 diabetes individuals. 
Expression profiles were generated using GeneSpring™ software analysis as described herein. 
Each column represents the hybridization pattern resulting from a single individual. Normal 
individuals have no known medical conditions and were not taking any known medication. 

10 Non-type 2 diabetes individuals presented without type 2 diabetes, but may have presented 
with other medical conditions and may be under various treatment regimes. Hybridizations to 
create said gene expression profiles were done using the ChondroChip™. A dendogram 
analysis is shown above. Samples are clustered and marked as representing patients who have 
type 2 diabetes, are normal or do not have type 2 diabetes. The indicates those patients 

15 who abnormally clustered despite actual presentation. The number of hybridizations profiles 
determined for type 2 diabetes, non-type 2 diabetes and normal individuals are shown. 915 
were identified as being differentially expressed with a p value of < 0.05 as between the type 2 
diabetes patients and the combination of normal and non type 2 diabetes individuals is noted. 
The identity of the differentially expressed genes is shown in Table 3G. 

20 Classification or class prediction of a test sample of an individual so as to determine 

whether said individual has type 2 diabetes or does not have type 2 diabetes can be done using 
the differentially expressed genes as shown in Table 3G as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon Genetics 

25 (e.g. GeneSpring™) for Class Predication are also available. 
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EXAMPLE 17 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from individuals having hyperlipidemia as compared with gene expression profiles from normal 
individuals. 

5 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from patients with hyperlipidemia but without osteoarthritis as 
compared to blood samples taken from healthy patients. 

As used herein, "hyperlipidemia" is defined as an elevation of lipid protein profiles and 
includes the elevation of chylomicrons, very low-density lipoproteins (VLDL), intermediate- 

10 density lipoproteins (H)L), low-density lipoproteins (LDL), and/or high-density lipoproteins 
(HDL) as compared with the general population. Hyperlipidemia includes hypercholesterolemia 
and/or hypertriglyceridemia. By hypercholesterolemia, it is meant elevated fasting plasma total 
cholesterol level of>200mg/dL, and/or LDL-cholesterol levels of >130mg/dL. A desirable level 
of HDL-cholesterol is> 60mg/dL. By hypertriglyceridemia it is meant plasma triglyceride (TG) 

15 concentrations of greater than the 90 th or 95 th percentile for age and sex and can include, for 
example, TG > 160mg/dL as determined after an overnight fast. 

Blood samples were taken from patients who were diagnosed with hyperlipidemia as 
defined herein. Gene expression profiles were then analysed and compared to profiles from 
patients unaffected by any disease. In each case, the diagnosis of hyperlipidemia was 
20 corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K Chondrogene 
Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
25 expressed in blood samples from patients with hyperlipidemia as compared to healthy patients 
was determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz 
SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 
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Figure 15 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were identified as having hyperlipidemia as described herein as 
compared with gene expression profiles from normal and non-hyperlipidemia patients. 
Expression profiles were generated using GeneSpring™ software analysis as described herein. 

5 Each column represents the hybridization pattern resulting from a single individual Normal 
individuals have no known medical conditions and were not taking any known medication. 
Non hyperlipidemia individuals presented without elevated cholesterol or elevated 
triglycerides but may have presented with other medical conditions and may be under various 
treatment regimes. Hybridizations to create said gene expression profiles were done using the 

10 ChondroChip™. A dendogram analysis is shown above. Samples are clustered and marked as 
representing patients who have elevated lipids and/or cholesterol, are normal or do not have 
elevated lipids or cholesterol. The "*" indicates those patients who abnormally clustered as 
having either hyperlipidemia, normal or non-hyperlipidemia despite actual presentation. The 
number of hybridizations profiles determined for hyperlipidemia patients, non-hyperlipidemia 

15 patients and normal individuals are shown. 1 ,022 genes were identified as being differentially 
expressed with a p value of < 0.05 as between the patients with hyperlipidemia and the 
combination of normal and non hyperlipidemia individuals. The identity of the differentially 
expressed genes is shown in Table 3H. 

Classification or class prediction of a test sample of an individual as having 
20 hyperlipidemia or not having hyperlipidemia can be done using the differentially expressed 
genes as shown in Table 3H as the predictor genes in combination with well known statistical 
algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics for Class 
Predication (e.g. GeneSpring™) are also available. 

25 

EXAMPLE 18 

ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 
from individuals having lung disease as compared with gene expression profiles from normal 
individuals. 
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This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with lung disease but without osteoarthritis as 
compared to blood samples taken from healthy patients. 

As used herein, "lung disease 55 encompasses any disease that affects the respiratory 
5 system and includes bronchitis, chronic obstructive lung disease, emphysema, asthma, and lung 
cancer. Patients identified as having lung disease includes patients having one or more of the 
above noted conditions. 

Blood samples were taken from patients who were diagnosed with lung disease as 
defined herein. Gene expression profiles were then analysed and compared to profiles from 
10 patients unaffected by any disease. In each case, the diagnosis of lung disease was corroborated 
by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K Chondrogene 
15 Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 

expressed in blood samples from patients with lung disease as compared to healthy patients was j 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

20 Figure 16 shows a diagrammatic representation of gene expression profiles of blood 

samples from individuals who were identified as having lung disease as described herein as 
compared with gene expression profiles from normal and non lung disease individuals. 
Expression profiles were generated using GeneSpring™ software analysis as described herein. 
Each column represents the hybridization pattern resulting from a single individual. Normal 

25 individuals have no known medical conditions and were not taking any known medication. Non- 
lung disease individuals presented without lung disease, but may have presented with other 
medical conditions and may be under various treatment regimes. Hybridizations to create said 
gene expression profiles were done using the ChondroChip™. A dendogram analysis is shown 
above. Samples are clustered and marked as representing patients who have lung disease, are 
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normal or do not have lung disease. The "*" indicates those patients who abnormally clustered 
despite actual presentation. The .number of hybridizations profiles determined for either the lung 
disease patients, non-lung disease patients and normal individuals are show. 596 genes were 
identified as being differentially expressed with a p value of < 0.05 as between the lung disease 
5 patients and the combination of normal and non lung disease individuals is noted. The identity 
of the differentially expressed genes is shown in Table 31. 

Classification or class prediction of a test sample of an individual to determine whether 
said individual has lung disease or does not having lung disease can be done using the 
differentially expressed genes as shown in Table 31 as the predictor genes in combination with 
10 well known statistical algorithms as would be understood by a person skilled in the art and 

described herein. Commercially available programs such as those provided by Silicon Genetics 
(e.g. GeneSpring™) for Class Predication are also available. 

EXAMPLE 19 

15 Affymetrix U133A Chip Microarray Data Analysis of gene expression profiles of blood 

samples from individuals having bladder cancer as compared with gene expression profiles from 
normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with bladder cancer but without osteoarthritis as 
20 compared to blood samples taken from healthy patients. 

As used herein, the term "cancer" or "carcinoma" is defined as a disease in which cells 
behave abnormally and includes; (i) cancers which originate from a single cell proliferating to 
form a clone of malignant cells, (ii) cancers wherein the growth of the cell is not regulated by 
normal biological and physical influences of the environment, (iii) anaplasic cancer, wherein the 
25 cells lack normal coordinated cell differentiation and (iv) metastasis cancer, wherein the cells 
have the capacity for discontinuous growth and dissemination to other parts of the body. The 
diagnosis of cancer can include careful clinical assessment and/or diagnostic investigations 
including endoscopy, imaging, histopathology, cytology and laboratory studies. 
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As used herein, "bladder cancer" includes carcinomas that occur in the transitional 
epithelium lining the urinary tract, starting at the renal pelvis and extending through the ureter, 
the urinary bladder, and the proximal two-thirds of the urethra. As used herein, patients 
diagnosed with bladder cancer include patients diagnosed utilizing any of the following methods 
5 or a combination thereof: urinary cytologic evaluation, endoscopic evaluation for the presence of 
malignant cells, CT (computed tomography), MRI (magnetic resonance imaging) for metastasis 
status. 

Blood samples were taken from patients who were diagnosed with bladder cancer as 
defined herein. Gene expression profiles were then analysed and compared to profiles from 
10 patients unaffected by any disease. In each case, the diagnosis of bladder cancer was 
corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a Affymetrix U133A 
15 Chip as described herein. Identification of genes differentially expressed in blood samples from 
patients with bladder cancer as compared to healthy patients was determined by statistical 
analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics., 5th 
ed. New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Figure 17 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were identified as having bladder cancer as described herein as 
compared with gene expression profiles from non bladder cancer individuals. Expression 
profiles were generated using GeneSpring™ software analysis as described herein. Each column 
represents the hybridization pattern resulting from a single individual. Non bladder cancer 
individuals presented without bladder cancer, but may have presented with other medical 
conditions and may be under various treatment regimes. Hybridizations to create said gene 
expression profiles were done using the Affymetrix U133A chip. A dendogram analysis is 
shown above. Samples are clustered and marked as representing patients who have bladder 
cancer, or do not have bladder cancer. The "*" indicates those patients who abnormally 
clustered as either bladder cancer, or non bladder cancer despite actual presentation. The number 
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of hybridizations profiles determined for patients with bladder cancer and without bladder cancer 
are shown. 4,228 genes were identified as being differentially expressed with a p value of < 0.05 
as between the bladder cancer patients and the non bladder cancer individuals is noted. The 
identity of the differentially expressed genes is shown in Table 3 J. 

5 Classification or class prediction of a test sample of an individual to determine whether 

said individual has bladder cancer or does not having bladder cancer can be done using the 
differentially expressed genes as shown in Table 3 J as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon Genetics 

10 (e.g. GeneSpring™) for Class Predication are also available. 



EXAMPLE 20 

Affymetrix U133A Chip Microarray Data Analysis of gene expression profiles of blood 
samples from individuals having early or advanced bladder cancer as compared with gene 
15 expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with early or advanced late stage bladder cancer 
but without osteoarthritis as compared to blood samples taken from healthy patients. 

As used herein, "early stage bladder cancer" includes bladder cancer wherein the 
20 detection of the anatomic extent of the tumor, both in its primary location and in metastatic sites, 
as defined by the TNM staging system in accordance with Harrison's Principles of Internal 
Medicine 14th edition can be considered early stage. More specifically, early stage bladder 
cancer can include those instances wherein the carcinoma is mainly superficial. 

As used herein, "advanced stage bladder cancer" is defined as bladder cancer wherein the 
25 detection of the anatomic extent of the tumor, both in its primary location and in metastatic sites, 
as defined by the TNM staging system in accordance with Harrison's Principles of Internal 
Medicine 14th edition, can be considered as advanced stage. More specifically, advanced stage 
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carcinomas can involve instances wherein the cancer has infiltrated the muscle and wherein 
metastasis has occurred. 

Blood samples were taken from patients who were diagnosed with early or advanced late 
stage bladder cancer as defined herein. Gene expression profiles were then analysed and 
5 compared to profiles from patients unaffected by any disease. In each case, the diagnosis of 
early or advanced late stage bladder cancer was corroborated by a skilled Board certified 
physician . 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 

10 generated as described above. Each probe was denatured and hybridized to a Affymetrix U133A 
Chip as described herein. Identification of genes differentially expressed in blood samples from 
patients with early or advanced late stage bladder cancer as compared to healthy patients was 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz S A. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 

15 2002). 

Figure 18 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were identified as having advanced stage bladder cancer or early 
stage bladder cancer as described herein as compared with gene expression profiles from non 
bladder cancer individuals. Expression profiles were generated using GeneSpring™ software 

20 analysis as described herein. Each column represents the hybridization pattern resulting from a 
single individual. Non bladder cancer individuals presented without bladder cancer, but may 
have presented with other medical conditions and may be under various treatment regimes. 
Hybridizations to create said gene expression profiles were done using the Affymetrix U1338 
chip. A dendogram analysis is shown above. Samples are clustered and marked as representing 

25 patients who have early stage bladder cancer, advanced stage bladder cancer, or do not have 
bladder cancer. The "*" indicates those patients who abnormally clustered despite actual 
presentation. The number of hybridizations profiles determined for either early stage bladder 
cancer, advanced bladder cancer or non-bladder cancer are shown. 3,518 genes were identified 
as being differentially expressed with a p value of < 0.05 as between the bladder cancer patients 
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and the non bladder cancer individuals is noted. The identity of the differentially expressed genes 
is shown in Table 3K. 

Classification or class prediction of a test sample of an individual to determine whether 
said individual has advanced bladder cancer, early stage bladder cancer or does not have bladder 
cancer can be done using the differentially expressed genes as shown in Table 3K as the 
predictor genes in combination with well known statistical algorithms as would be understood by 
a person skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

EXAMPLE 21 

Affymetrix U133A Chip Microarray Data Analysis of gene expression profiles of blood 
samples from individuals having coronary artery disease as compared with gene expression 
profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with coronary artery disease but without 
osteoarthritis as compared to blood samples taken from healthy patients 

As used herein, "Coronary artery disease" (CAD) is defined as a condition wherein at 
least one coronary artery has >50% luminal diameter stenosis, as diagnosed by coronary 
angiography and includes conditions in which there is atheromatous narrowing and subsequent 
occlusion of the vessel. CAD includes those conditions which manifest as angina, silent 
ischaemia, unstable angina, myocardial infarction, arrhythmias, heart failure, and sudden death. 
Patients identified as having CAD herein Coronary artery disease is defined 

Blood samples were taken from patients who were diagnosed with Coronary artery 
disease as defined herein. Gene expression profiles were then analysed and compared to profiles 
from patients unaffected by any disease. In each case, the diagnosis of Coronary artery disease 
was corroborated by a skilled Board certified physician . 
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Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a Affymetrix U133A 
Chip as described herein. Identification of genes differentially expressed in blood samples from 
5 patients with Coronary artery disease as compared to healthy patients was determined by 
statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz S A. Primer of 
Biostatistics., 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Figure 19 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were identified as having coronary artery disease (CAD) as 
described herein as compared with gene expression profiles from non-coronary artery disease 
individuals. Expression profiles were generated using GeneSpring™ software analysis as 
described herein. Each column represents the hybridization pattern resulting from a single 
individual. Non coronary artery disease individuals presented without coronary artery disease, 
but may have presented with other medical conditions and may be under various treatment 
regimes. Hybridizations to create said gene expression profiles were done using the 
Affymetrix™ U133A chip. A dendogram analysis is shown above. Samples are clustered and 
marked as representing patients who have coronary artery disease or do not have coronary artery 
disease. The "*" indicates those patients who abnormally clustered despite actual presentation. 
The number of hybridizations profiles determined for patients with CAD or without CAD are 
shown. 967 genes were identified as being differentially expressed with a p value of < 0.05 as 
between the coronary artery disease patients and those individuals without coronary artery 
disease is noted. The identity of the differentially expressed genes is shown in Table 3L. 

Classification or class prediction of a test sample of an individual to determine whether 
said individual has CAD or does not have CAD can be done using the differentially expressed 
25 genes as shown in Table 3L as the predictor genes in combination with well known statistical 
algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics for Class 
Predication (e.g. GeneSpring™) are also available. 
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EXAMPLE 22 

Affymetrix U133A Chip Microarray Data Analysis of gene expression profiles of blood 
samples from individuals having Rheumatoid arthritis as compared with gene expression profiles 
from normal individuals. 

5 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from patients with Rheumatoid arthritis but without 
osteoarthritis as compared to blood samples taken from healthy patients. 

Rheumatoid arthritis (RA) is defined as a chronic, multisystem disease of unknown 
etiology with the characteristic feature of persistent inflammatory synovitis. Said inflammatory 
10 synovitis usually involves peripheral joints in a systemic distribution. Patients having RA as 
defined herein were identified as having one or more of the following; (i)cartilage destruction, 
(ii)bone erosions and/or (iii) joint deformities. 

Blood samples were taken from patients who were diagnosed Rheumatoid arthritis as 
defined herein. Gene expression profiles were then analysed and compared to profiles from 
15 patients unaffected by any disease. In each case, the diagnosis of Rheumatoid arthritis was 
corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a Affymetrix U133A 
20 Chip as described herein. Identification of genes differentially expressed in blood samples from 
patients with Rheumatoid arthritis as compared to healthy patients was determined by statistical 
analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics., 5th 
ed. New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Figure 20 shows a diagrammatic representation of gene expression profiles of blood 
25 samples from individuals who were identified as having rheumatoid arthritis as described herein 
as compared with gene expression profiles from non-rheumatoid arthritis individuals. 
Expression profiles were generated using GeneSpring™ software analysis as described herein. 
Each column represents the hybridization pattern resulting from a single individual. Normal 
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individuals have no known medical conditions and were not taking any known medication. Non 
rheumatoid arthritis individuals presented without rheumatoid arthritis, but may have presented 
with other medical conditions and may be under various treatment regimes. Hybridizations to 
create said gene expression profiles were done using ChondroChip™. A dendogram analysis is 

5 shown above. Samples are clustered and marked as representing patients who have rheumatoid 
arthritis or do not have rheumatoid arthritis. The "*" indicates those patients who abnormally 
clustered despite actual presentation. The number of hybridizations profiles determined for 
patients with rheumatoid arthritis and without rheumatoid arthritis are shown. 2,068 genes were 
identified as being differentially expressed with a p value of < 0.05 as between the rheumatoid 

10 arthritis patients and a combination of those individuals without rheumatoid arthritis and normal 
is noted. The identity of the differentially expressed genes is shown in Table 3M. 

Classification or class prediction of a test sample of an individual as having rheumatoid 
arthritis or not having rheumatoid arthritis can be done using the differentially expressed genes 
as shown in Table 3M as the predictor genes in combination with well known statistical 
15 algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics for Class 
Predication (e.g. GeneSpring™) are also available. 



EXAMPLE 23 

20 Affymetrix U133A Chip Microarray Data Analysis of gene expression profiles of blood 

samples from individuals having depression as compared with gene expression profiles from 
normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with depression but without osteoarthritis as 
25 compared to blood samples taken from healthy patients 

As used herein "mood disorders" are conditions characterized by a disturbance in the 
regulation of mood, behavior, and affect. "Mood disorders" can include depression, anxiety, 
schizophrenia, bipolar disorder, manic depression and the like. 
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As used herein "depression" includes depressive disorders or depression in association 
with medical illness or substance abuse in addition to depression as a result of sociological 
situations. Patients defined as having depression were diagnosed mainly on the basis of clinical 
symptoms including a depressed mood episode wherein a person displays a depressed mood on a 
5 daily basis for a period of greater than 2 weeks. A depressed mood episode may be characterized 
by sadness, indifference, apathy, or irritability and is usually associated with changes in a 
number of neurovegetative functions, including sleep patterns, appetite and weight, fatigue, 
impairment in concentration and decision making. 

Blood samples were taken from patients who were diagnosed with depression as defined 
10 herein. Gene expression profiles were then analysed and compared to profiles from patients 
unaffected by any disease. In each case, the diagnosis of depression was corroborated by a 
skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
15 generated as described above. Each probe was denatured and hybridized to a Affymetrix U133A 
Chip as described herein. Identification of genes differentially expressed in blood samples from 
patients with depression as compared to healthy patients was determined by statistical analysis 
using the Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New 
York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Figure 21 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were identified as having depression as described herein as 
compared with gene expression profiles from non-depression individuals. Expression profiles 
were generated using GeneSpring™ software analysis as described herein. Each column 
represents the hybridization pattern resulting from a single individual. Normal individuals have 
no known medical conditions and were not taking any known medication. Non depression 
individuals presented without depression, but may have presented with other medical conditions 
and may be under various treatment regimes. Hybridizations to create said gene expression 
profiles were done using ChondroChip™ . A dendogram analysis is shown above. Samples are 
clustered and marked as representing patients who have depression, having non-depression or 
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normal. The "*" indicates those patients who abnormally clustered despite actual presentation. 
The number of hybridizations profiles determined for patients with depression, non-depression 
and normal are shown. 941 genes were identified as being differentially expressed with a p 
value of < 0.05 as between the patients with depression and a combination of those individuals 
5 without depression and normal is noted. The identity of the differentially expressed genes is 
shown in Table 3N. 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has depression or does not having depression can be done using the differentially 
expressed genes as shown in Table 3N as the predictor genes in combination with well known 
10 statistical algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

EXAMPLE 24 

15 ChondroChip™ Microarray Data Analysis of gene expression profiles of blood samples 

from individuals having osteoarthritis as compared with gene expression profiles from normal 
individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients who were identified as having various stages of 
20 osteoarthritis as compared to blood samples taken from healthy patients. 

Osteoarthritis (OA), as used herein also known as "degenerative joint disease", represents 
failure of a diarthrodial (movable, synovial-lined) joint. It is a condition, which affects joint 
cartilage, and or subsequently underlying bone and supporting tissues leading to pain, stiffness, 
movement problems and activity limitations. It most often affects the hip, knee, foot, and hand, 
25 but can affect other joints as well. 

OA severity can be graded according to the system described by Marshall (Marshall KW. 
J Rheumatol, 1996:23(4) 582-85). Briefly, each of the six knee articular surfaces was assigned a 
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cartilage grade with points based on the worst lesion seen on each particular surface. Grade 0 is 
normal (0 points), Grade I cartilage is soft or swollen but the articular surface is intact (1 point). 
In Grade II lesions, the cartilage surface is not intact but the lesion does not extend down to 
subchondral bone (2 points). Grade III damage extends to subchondral bone but the bone is 
5 neither eroded nor eburnated (3 points). In Grade IV lesions, there is eburnation of or erosion 
into bone (4 points). A global OA score is calculated by summing the points from all six 
cartilage surfaces. If there is any associated pathology, such as meniscus tear, an extra point will 
be added to the global score. Based on the total score, each patient is then categorized into one 
of four OA groups: mild (1-6), moderate (7-12), marked (13-18), and severe (>18). As used 
10 herein, patients identified with OA may be categorized in any of the four OA groupings as 
described above. 

Blood samples were taken from patients who were diagnosed with osteoarthritis and a 
specific stage of osteoarthritis as defined herein. Gene expression profiles were then analysed 
and compared to profiles from patients unaffected by any disease. In each case, the diagnosis of 
15 osteoarthritis and the stage of osteoarthritis was corroborated by a skilled Board certified 
physician . 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K Chondrogene 
20 Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
expressed in blood samples from patients with disease as compared to healthy patients was 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics., 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

25 Figure 22 shows a diagrammatic representation of gene expression profiles of blood 

samples from individuals having osteoarthritis as compared with gene expression profiles from 
normal individuals. Expression profiles were generated using GeneSpring™ software analysis 
as described herein. Each column represents the hybridization pattern resulting from a single 
individual. Normal individuals have no known medical conditions and were not taking any 
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known medication. Hybridizations to create said gene expression profiles were done using the 
ChondroChip™. A dendogram analysis is shown above. Samples are clustered and marked as 
representing patients who presented with different stages of osteoarthritis or normal. The "*" 
indicates those patients who abnormally clustered despite actual presentation. The number of 
5 hybridizations profiles determined for either osteoarthritis patients or normal individuals are 
shown. 300 differentially expressed genes were identified as being differentially expressed with 
a p value of < 0.05 as between the osteoarthritis patients and normal individuals. The identity of 
the differentially expressed genes is shown in Table 30. 

Classification or class prediction of a test sample of an individual as having OA, having 
10 mild OA, having marked OA, having moderate OA, having severe OA or not having OA can be 
done using the differentially expressed genes as shown in Table 30 as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person skilled 
in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

15 

EXAMPLE 25 

Microarray Data Analysis of gene expression profiles of blood samples from individuals 
having a condition as compared with gene expression profiles from individuals not having said 
condition, and wherein said individual is undergoing therapeutic treatment in light of said 
20 condition. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from individuals undergoing therapeutic treatment of a 
condition as compared with gene expression profiles from individuals not undergoing treatment. 

Blood samples are taken from patients who are undergoing therapeutic treatment. Gene 
25 expression profiles are then analysed and compared to profiles from patients not undergoing 
treatment. 



90 



Attorney Docket No.: 4231/2055C 



Total mRNA from a drop of peripheral whole blood taken from each patient is isolated 
using TRIzol® reagent (GEBCO) and fluorescently labeled probes for each blood sample are 
generated as described above. Each probe is denatured and hybridized to a microarray for 
example the 15K Chondrogene Microarray Chip (ChondroChip™), Affymetrix Genechip or 
5 Blood chip as described herein. Identification of genes differentially expressed in blood samples 
from patients undergoing therapeutic treatment as compared to patients not undergoing treatment 
is determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz S A. 
Primer of Biostatistics., differentially expressed genes are then identified as being differentially 
expressed with a p value of < 0.05. 

10 

EXAMPLE 26 

Affymetrix U133A Chip Microarray Data Analysis of gene expression profiles of blood 
samples from individuals having liver cancer as compared with gene expression profiles from 
normal individuals. 

15 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from patients with liver cancer as compared to blood samples 
taken from healthy patients. 

As used herein, "liver cancer" means primary liver cancer wherein the cancer initiates in 
the liver. Primary liver cancer includes both hepatomas or hepatocellular carcinomas (HCC) 
20 which start in the liver and chonalgiomas where cancers develop in the bile ducts of the liver. 

Blood samples were taken from patients who were diagnosed with liver cancer as defined 
herein. Gene expression profiles were then analysed and compared to profiles from patients 
unaffected by any disease. In each case, the diagnosis of liver cancer was corroborated by a 
skilled Board certified physician . 

25 Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 

using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a Affymetrix U133A 
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Chip as described herein. Identification of genes differentially expressed in blood samples from 
patients with liver cancer as compared to healthy patients was determined by statistical analysis 
using the Weltch t-Test. 

Figure 25 shows a diagrammatic representation of gene expression profiles of blood 
5 samples from individuals who were identified as having liver cancer as described herein as 
compared with gene expression profiles from non-liver cancer disease individuals. Expression 
profiles were generated using GeneSpring™ software analysis as described herein. Each column 
represents the hybridization pattern resulting from a single individual. Control samples 
presented without liver cancer but may have presented with other medical conditions and may be 
1 0 under various treatment regimes. 

Hybridizations to create said gene expression profiles were done using the Affymetrix™ 
U133A chip. A dendogram analysis is shown above. Samples are clustered and marked as 
representing patients who have liver cancer or control. The number of hybridizations profiles 
determined for patients with liver cancer or who are controls are shown. 1,475 genes were 
15 identified as being differentially expressed with a p value of < 0.05 as between the liver cancer 
patients and those control individuals. The identity of the differentially expressed genes is 
shown in Table 3X. 

Classification or class prediction of a test sample of an individual to determine whether 
said individual has liver cancer or does not have liver cancer can be done using the differentially 
20 expressed genes as shown in Table 3X as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 



25 EXAMPLE 27 

Affymetrix U133A Chip Microarray Data Analysis of gene expression profiles of blood 
samples from individuals having schizophrenia as compared with gene expression profiles from 
normal individuals. 
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This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with schizophrenia as compared to blood 
samples taken from healthy patients. 

As used herein, "schizophrenia" is defined as a psychotic disorders characterized by 
5 distortions of reality and disturbances of thought and language and withdrawal from social 
contact. Patients diagnosed with "schizophrenia" can include patients having any of the 
following diagnosis: an acute schizophrenic episode, borderline schizophrenia, catatonia, 
catatonic schizophrenia, catatonic type schizophrenia, disorganized schizophrenia, disorganized 
type schizophrenia, hebephrenia, hebephrenic schizophrenia, latent schizophrenia, paranoic type 
10 schizophrenia, paranoid schizophrenia, paraphrenia, paraphrenic schizophrenia, psychosis, 
reactive schizophrenia or the like. 

Blood samples were taken from patients who were diagnosed with schizophrenia as 
defined herein. Gene expression profiles were then analysed and compared to profiles from 
patients unaffected by any disease. In each case, the diagnosis of schizophrenia was 
15 corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a Affymetrix U133A 
Chip as described herein. Identification of genes differentially expressed in blood samples from 
20 patients with schizophrenia as compared to healthy patients was determined by statistical 

analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th 
ed. New York, USA: McGraw-Hill Medical Publishing Division. 

Figure 26 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were identified as having schizophrenia as described herein as 
25 compared with gene expression profiles from non schizophrenic individuals. Expression profiles 
were generated using GeneSpring™ software analysis as described herein. Each column 
represents the hybridization pattern resulting from a single individual. Control samples 
presented without schizophrenia but may have presented with other medical conditions and may 
be under various treatment regimes. Hybridizations to create said gene expression profiles were 
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done using the Affymetrix™ U133A chip. A dendogram analysis is shown above. Samples are 
clustered and marked as representing patients who have schizophrenia or control individuals. 
The number of hybridizations profiles determined for patients with liver cancer or who are 
controls are shown. 1,952 genes were identified as being differentially expressed with a p value 
5 of < 0.05 as between the schizophrenic patients and those control individuals. The identity of the 
differentially expressed genes is shown in Table 3Y. 

Classification or class prediction of a test sample of an individual to determine whether 
said individual has schizophrenia or does not having schizophrenia can be done using the 
differentially expressed genes as shown in Table 3Y as the predictor genes in combination with 
10 well known statistical algorithms as would be understood by a person skilled in the art and 

described herein. Commercially available programs such as those provided by Silicon Genetics 
(e.g. GeneSpring™) for Class Predication are also available. 

EXAMPLE 28 

15 Affymetrix U133A Chip Microarray Data Analysis of gene expression profiles of blood 

samples from individuals having Chagas disease as compared with gene expression profiles from 
normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients with symptomatic Chagas disease, 
20 asymptomatic Chagas disease or control individuals wherein said control individuals were 
confirmed as not having Chagas disease. 

As used herein, "Chagas disease 55 is defined as a condition wherein an individual is 
infected with the protozoan parasite Trypanosoma cruzi and includes both acute and chronic 
infection. Acute infection with T. cruzi can be diagnosed by detection of parasites by either 
25 microscopic examination of fresh anticoagulated blood or the buffy coat, giemsa-stained thin and 
thick blood smears and/or mouse inoculation and culturing of the blood of a potentially infected 
individual. Even in the absence of a positive result from the above, an accurate determination of 
infection can be made by xenodiagnosis wherein reduviid bugs are allowed to feed on the 
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patient's blood and subsequently the bugs are examined for infection. Chronic infection can be 
determined by detection of antibodies specific to the T. cruzi antigens and/or 
immunoprecipitation and electrophoresis of the T. cruzi antigens. 

As used herein "Symptomatic Chagas disease" includes symptomatic acute chagas and 
5 symptomatic chronic chagas disease. Acute symptomatic chagas disease can be characterized by 
one or more of the following: area of erythema and swelling (a chagoma); local 
lymphadenopathy; generalized lymphadenopathy; mild hepatosplenomegaly; unilateral painless 
edema of the palpebrae and periocular tissues; malaise; fever; anorexia and/or edema of the face 
and lower extremities. Symptomatic chronic Chagas' disease include one or more of the 
10 following symptoms: heart rhythm disturbances, cardiomyopathy, thromboembolism, 

electrocardiographic abnormalities including right bundle-branch blockage; atrioventricular 
block; premature ventricular contractions and tachy- and bradyarrhythmias; dysphagia; 
odynophagia, chest pain; regurgitation; weight loss, cachexia and pulmonary infections. 

As used herein "Asymptomatic Chagas disease" is meant to refer to individuals who are 
15 infected with T. cruzi but who do not show either acute or chronic symptoms of the disease. 

Blood samples were taken from patients who were diagnosed symptomatic or 
asymptomatic Chagas disease as defined herein. Gene expression profiles were then analysed 
and compared to profiles from patients unaffected by any disease. In each case, the diagnosis of . 
Chagas disease was corroborated by a qualified physician. 

20 Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 

using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a Affymetrix U133A 
Chip as described herein. Identification of genes differentially expressed in blood samples from 
patients with Chagas disease as compared to healthy patients was determined by statistical 

25 analysis using the Weltch ANOVA test (Michelson and Schofield, 1996). 

Figure 27 shows a diagrammatic representation of gene expression profiles of blood 
samples from individuals who were identified as having symptomatic Chagas disease; 
asymptomatic Chagas disease or who were control individuals as described herein as compared 
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with gene expression profiles from non schizophrenic individuals. Expression profiles were 
generated using GeneSpring™ software analysis as described herein. Each column represents 
the hybridization pattern resulting from a single individual. Control samples presented without 
Chagas disease but may have presented with other medical conditions and may be under various 
5 treatment regimes. 

Hybridizations to create said gene expression profiles were done using the Affymetrix™ 
U133A chip. A dendogram analysis is shown above. Samples are clustered and marked as 
representing patients who have symptomatic chagas disease; asymptomatic chagas disease or 
control. The number of hybridizations profiles determined for patients with chagas disease; 
10 asymptomatic chagas disease or who are controls are shown. 668 genes were identified as being 
differentially expressed with a p value of < 0.05 as between the symptomatic, asymptomatic 
Chagas patients and those control individuals. The identity of the differentially expressed genes 
is shown in Table 3Y. 

Classification or class prediction of a test sample of an individual to determine whether 
15 said individual has symptomatic Chagas disease, asymptomatic Chagas disease or does not have 
Chagas disease can be done using the differentially expressed genes as shown in Table 3 Y as the 
predictor genes in combination with well known statistical algorithms as would be understood by 
a person skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

20 

EXAMPLE 29 

Identification of Genes Specific for OA Only by Removing Genes Relevant to Co- 
Morbidities and Other Disease States. 

This example demonstrates the use of the claimed invention to detect differential gene 
25 expression in blood unique to Osteoarthritis as compared with other disease states. 

Blood samples were taken from patients who were diagnosed with mild OA or severe OA 
and compared with individuals who were identified as normal individuals as defined herein. 
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Gene expression profiles were then analysed to identify genes which are differentially expressed 
in OA as compared with normal. In each case, the diagnosis of OA was corroborated by a 
qualified physician. 

Total mRNA from a drop of peripheral whole blood taken from each patient was isolated 
5 using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a ChondroChip™ as 
described herein. Identification of genes differentially expressed in blood samples from patients 
with mild or severe OA as compared to healthy patients was determined by statistical analysis 
using the Weltch ANOVA test (Michelson and Schofield, 1996). (Dendogram analysis not 
10 shown). 

In order to identify genes differentially expressed in blood unique to OA but not 
differentially expressed as a result of possible co-morbidities including hypertension, obesity, 
asthma, taking systemic steroids, or allergies, genes identified as differentially expressed in both 
OA and any of the genes identified as differentially expressed as a result of co-morbidity, e.g., 
Table 3 A (co-morbidity of OA and hypertension v. normal), Table 3B (co-morbidity of OA and 
obesity v. normal), Table 3C (co-morbidity of OA and allergy v. normal), Table 3D (co- 
morbidity of OA and taking systemic steroids v. normal), and genes in common with people 
identified as having asthma and OA (Table 3 AA) were removed. Similarly any genes and 
unique to obesity (Table 3R), hypertension (Table 3P), allergies (Table 3T), systemic steroids 
(Table 3 V) were also removed. As a result of these comparisons, a list of genes unique to 
individuals with OA were identified. The identity of the differentially expressed genes is shown 
in Table 3 AB. 

It would be clear to a person skilled in the art that rather than simply remove those genes 
which are relevant to other disease states, one could use a more refined analysis and remove 
25 those genes which show the same trend in gene expression, e.g. remove those genes which show 
up regulation in a co-morbid state and also show up-regulation in the single disease state, but 
retain those genes which show a different trend in gene expression e.g. retain those genes which 
show up regulation in a co-morbid state as compared to down regulation in a single disease state. 
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Classification or class prediction of a test sample of an individual to determine whether 
said individual has OA or does not have OA can be done using the differentially expressed genes 
as shown in Table 3AB, irrespective of whether the individual presents with co-morbidity using 
well known statistical algorithms as would be understood by a person skilled in the art and 
5 described herein. Commercially available programs such as those provided by Silicon Genetics 
(e.g. GeneSpring™) for Class Predication are also available. 



EXAMPLE 30 

Analysis of gene expression profiles of blood samples from individuals having brain 
10 cancer as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients diagnosed with brain cancer as compared to 
blood samples taken from healthy patients. 

As used herein "brain cancer" refers to all forms of primary brain tumors, both 
15 intracranial and extracranial and includes one or more of the following: Glioblastoma, 

Ependymoma, Gliomas, Astrocytoma, Medulloblastoma, Neuroglioma, Oligodendroglioma, 
Meningioma, Retinoblastoma, and Craniopharyngioma. 

Blood samples are taken from patients diagnosed with brain cancer as defined herein. 
Gene expression profiles are then analysed and compared to profiles from patients unaffected by 
20 any disease. Preferably healthy patients are chosen who are age and sex matched to said patients 
diagnosed with disease. In each case, the diagnosis of brain cancer is corroborated by a skilled 
Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
25 are generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with brain cancer as compared to healthy patients is 
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determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

Classification or class prediction of a test sample of an individual to determine whether 
5 said individuals has brain cancer or does not having brain cancer can be done using the 
differentially expressed genes identified as described above as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person skilled 
in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

10 

EXAMPLE 31 

Analysis of gene expression profiles of blood samples from individuals having 
ankylosing spondylitis as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
15 expression in blood samples taken from patients diagnosed with ankylosing spondylitis as 
compared to blood samples taken from healthy patients. 

As used herein "ankylosing spondylitis" refers to a chronic inflammatory disease that 
affects the joints between the vertebrae of the spine, and/or the joints between the spine and the 
pelvis and can eventually cause the affected vertebrae to fuse or grow together. 

20 Blood samples are taken from patients diagnosed with ankylosing spondylitis as defined 

herein. Gene expression profiles are then analysed and compared to profiles from patients 
unaffected by any disease. Preferably healthy patients are chosen who are age and sex matched 
to said patients diagnosed with disease. In each case, the diagnosis of ankylosing spondylitis is 
corroborated by a skilled Board certified physician. 

25 Total mRNA from a drop of peripheral whole blood is taken from each patient and 

isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
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is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with ankylosing spondylitis as compared to healthy 
patients is determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
5 (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing 
Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has ankylosing spondylitis or does not having ankylosing spondylitis can be 
done using the differentially expressed genes identified as described above as the predictor genes 
10 in combination with well known statistical algorithms as would be understood by a person 

skilled in the art and described herein. Commercially available programs such as those provided 
by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

EXAMPLE 32 

15 Analysis of gene expression profiles of blood samples from individuals having prostate 

cancer as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients diagnosed with prostate cancer as compared to 
blood samples taken from healthy patients 

20 As used herein "prostate cancer" refers to a malignant cancer originating within the 

prostate gland. Patients identified as having prostate cancer can have any stage of prostate 
cancer, as determined clinically (by digital rectal exam or PSA testing) and or pathologically. 
Staging of prostate cancer can done in accordance with TNM or the Staging System of the 
American Joint Committee on Cancer (AJCC). In addition to the TNM system, other systems 

25 may be used to stage prostate cancer, for example, the Whitmore-Jewett system. 

Blood samples are taken from patients diagnosed with prostate cancer as defined herein. 
Gene expression profiles are then analysed and compared to profiles from patients unaffected by 
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any disease to identify genes which differentiate as between the two groups. Similarly gene 
expression profiles can be analysed so as to differentiate as between the severity of the prostate 
cancer. Preferably healthy patients are chosen who are age and sex matched to said patients 
diagnosed with disease or with a specific stage of said disease. In each case, the diagnosis of 
5 prostate cancer is corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
10 expressed in blood samples from patients with prostate cancer as compared to healthy patients is 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

Classification or class prediction of a test sample of an individual to determine whether 
15 said individuals has prostate cancer, has a specific stage of prostate cancer, or does not having 
prostate cancer can be done using the differentially expressed genes identified as described 
above as the predictor genes in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Commercially available programs 
such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also 
20 available. 

EXAMPLE 33 

Analysis of gene expression profiles of blood samples from individuals having ovarian 
cancer as compared with gene expression profiles from normal individuals. 

25 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from patients diagnosed with ovarian cancer as compared to 
blood samples taken from healthy patients. 
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As used herein "ovarian cancer" refers to a malignant cancerous growth originating 
within the ovaries. Patients identified as having ovarian cancer can have any stage of ovarian 
cancer. Staging is done by combining information from imaging tests with the results of a 
surgical examination done during a laprotomy. Numbered stages I to IV are used to describe the 
5 extent of the cancer and whether it has spread (metastasized) to more distant organs. 

Blood samples are taken from patients diagnosed with ovarian cancer, or with a specific 
stage of ovarian cancer as defined herein. Gene expression profiles are then analysed and 
compared to profiles from patients unaffected by any disease. Preferably healthy patients are 
chosen who are age and sex matched to said patients diagnosed with disease or with a specific 
10 stage of said disease. In each case, the diagnosis of ovarian cancer is corroborated by a skilled 
Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
15 Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 

expressed in blood samples from patients with ovarian cancer and or a specific stage of ovarian 
cancer as compared to healthy patients is determined by statistical analysis using the Wilcox 
Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has ovarian cancer, has a specific stage of ovarian cancer or does not having 
ovarian cancer can be done using the differentially expressed genes identified as described above 
as the predictor genes in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Commercially available programs 
such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also 
available. 
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EXAMPLE 34 

Analysis of gene expression profiles of blood samples from individuals having kidney 
cancer as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
5 expression in blood samples taken from patients diagnosed with kidney cancer as compared to 
blood samples taken from healthy patients. 

As used herein "kidney cancer" refers to a malignant cancerous growth originating within 
the kidneys. Kidney cancer includes renal cell carcinoma, transitional cell carcinoma, and 
Wilms' tumor. Patients identified as having renal cell carcinoma can also be categorized by 
10 stage of said cancer as determined by the System of the American Joint Committee on Cancer 
(AJCC). Numbered stages I to IV are used to describe the extent of the carcinoma and whether it 
has spread (metastased) to more distant organs. 

Blood samples are taken from patients diagnosed with kidney cancer, or with a specific 
stage of renal cell carcinoma as defined herein. Gene expression profiles are then analysed and 
15 compared to profiles from patients unaffected by any disease. Preferably healthy patients are 
chosen who are age and sex matched to said patients diagnosed with disease or with a specific 
stage of said disease. In each case, the diagnosis of kidney cancer is corroborated by a skilled 
Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with kidney cancer and or a specific stage of kidney 
cancer as compared to healthy patients is determined by statistical analysis using the Wilcox 
Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has kidney cancer, has a specific stage of kidney cancer or does not having 
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kidney cancer can be done using the differentially expressed genes identified as described 
above as the predictor genes in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication 
5 are also available. 

EXAMPLE 35 

Analysis of gene expression profiles of blood samples from individuals having gastric 
cancer as compared with gene expression profiles from normal individuals. 

10 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from patients diagnosed with gastric cancer as compared to 
blood samples taken from healthy patients. 

As used herein "gastric or stomach cancer" refers to a cancerous growth originating 
within the stomach and includes gastric adenocarcinoma, primary gastric lymphoma and gastric 
15 nonlymphoid sarcoma. Patients identified as having stomac can also be categorized by stage of 
said cancer as determined by the System of the American Joint Committee on Cancer (AJCC). 

Blood samples are taken from patients diagnosed with stomach cancer, or with a specific 
stage of stomach cancer as defined herein. Gene expression profiles are then analysed and 
compared to profiles from patients unaffected by any disease. Preferably healthy patients are 
20 chosen who are age and sex matched to said patients diagnosed with disease or with a specific 
stage of said disease. In each case, the diagnosis of stomach cancer is corroborated by a skilled 
Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
25 is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with stomach cancer and or a specific stage of stomach 
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cancer as compared to healthy patients is determined by statistical analysis using the Wilcox 
Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
5 said individuals has stomach cancer, has a specific stage of stomach cancer or does not having 
stomach cancer can be done using the differentially expressed genes identified as described 
above as the predictor genes in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication 
10 are also available. 

EXAMPLE 36 

Analysis of gene expression profiles of blood samples from individuals having lung 
cancer as compared with gene expression profiles from normal individuals. 

15 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from patients diagnosed with lung cancer as compared to 
blood samples taken from healthy patients. 

As used herein "lung cancer" refers to a cancerous growth originating within the lung and 
includes adenocarcinoma, alveolar cell carcinoma, squamous cell carcinoma, large cell and small 
20 cell carcinomas. Patients identified as having lung cancer can also be categorized by stage of 
said cancer as determined by the System of the American Joint Committee on Cancer (AJCC). 

Blood samples are taken from patients diagnosed with lung cancer, or with a specific 
stage of lung cancer as defined herein. Gene expression profiles are then analysed and compared 
to profiles from patients unaffected by any disease. Preferably healthy patients are chosen who 
25 are age and sex matched to said patients diagnosed with disease or with a specific stage of said 
disease. In each case, the diagnosis of lung cancer is corroborated by a skilled Board certified 
physician. 
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Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
5 expressed in blood samples from patients with lung cancer and or a specific stage of lung cancer 
as compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: McGraw- 
Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
10 said individuals has lung cancer, has a specific stage of lung cancer or does not having lung 
cancer can be done using the differentially expressed genes identified as described above as the 
predictor genes in combination with well known statistical algorithms as would be understood 
by a person skilled in the art and described herein. Commercially available programs such as 
those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available 

15 

EXAMPLE 37 

Analysis of gene expression profiles of blood samples from individuals having breast 
cancer as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
20 expression in blood samples taken from patients diagnosed with breast cancer as compared to 
blood samples taken from healthy patients. 

As used herein "breast cancer" refers to a cancerous growth originating within the breast 
and includes invasive and non invasive breast cancer such as ductal carcinoma in situ (DCIS), 
lobular carcinoma in situ (LCIS), infiltrating ductal carcinoma, and infiltrating lobular 
25 carcinoma. Patients identified as having breast cancer can also be categorized by stage of said 
cancer as determined by the System of the American Joint Committee on Cancer (AJCC) or 
TNM classification. 
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Blood samples are taken from patients diagnosed with breast cancer, or with a specific 
stage of breast cancer as defined herein. Gene expression profiles are then analysed and 
compared to profiles from patients unaffected by any disease. Preferably healthy patients are 
chosen who are age and sex matched to said patients diagnosed with disease or with a specific 
5 stage of said disease. In each case, the diagnosis of breast cancer is corroborated by a skilled 
Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
10 Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with breast cancer and or a specific stage of breast 
cancer as compared to healthy patients is determined by statistical analysis using the Wilcox 
Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

15 Classification or class prediction of a test sample of an individual to determine whether 

said individuals has breast cancer, has a specific stage of breast cancer or does not have breast 
cancer can be done using the differentially expressed genes identified as described above as the 
predictor genes in combination with well known statistical algorithms as would be understood 
by a person skilled in the art and described herein. Commercially available programs such as 

20 those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also 
available. 

EXAMPLE 38 

Analysis of gene expression profiles of blood samples from individuals having 
25 nasopharyngeal cancer as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients diagnosed with nasopharyngeal cancer as 
compared to blood samples taken from healthy patients. 
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As used herein "nasopharyngeal cancer" refers to a cancerous growth arising from the 
epithelial cells that cover the surface and line the nasopharynx. Patients identified as having 
nasopharyngeal cancer can also be categorized by stage of said cancer as determined by the 
System of the American Joint Committee on Cancer (AJCC) or TNM classification. 

5 Blood samples are taken from patients diagnosed with nasopharyngeal cancer, or with a 

specific stage of nasopharyngeal cancer as defined herein. Gene expression profiles are then 
analysed and compared to profiles from patients unaffected by any disease. Preferably healthy 
patients are chosen who are age and sex matched to said patients diagnosed with disease or with 
a specific stage of said disease. In each case, the diagnosis of nasopharyngeal cancer is 
10 corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
15 expressed in blood samples from patients with nasopharyngeal cancer and or a specific stage of 
breast cancer as compared to healthy patients is determined by statistical analysis using the 
Wilcox Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, 
USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
20 said individuals has nasopharyngeal cancer, has a specific stage of nasopharyngeal cancer or 
does not have nasopharyngeal cancer can be done using the differentially expressed genes 
identified as described above as the predictor genes in combination with well known statistical 
algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
25 GeneSpring™) for Class Predication are also available. 
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EXAMPLE 39 

Analysis of gene expression profiles of blood samples from individuals having Guillain 
Barre syndrome as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
5 expression in blood samples taken from patients diagnosed with Guillain Barre syndrome as 
compared to blood samples taken from healthy patients. 

As used herein "Guillain Barre syndrome" refers to an acute, usually rapidly progressive 
form of inflammatory polyneuropathy characterized by muscular weakness and mild distal 
sensory loss. 

10 Blood samples are taken from patients diagnosed with Guillain Barre syndrome as 

defined herein. Gene expression profiles are then analysed and compared to profiles from 
patients unaffected by any disease. Preferably healthy patients are chosen who are age and sex 
matched to said patients diagnosed with disease. In each case, the diagnosis of Guillain Barre 
syndrome is corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with Guillain Barre syndrome as compared to healthy 
patients is determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
(Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing 
Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has Guillain Barre syndrome, or does not have Guillain Barre syndrome can be 
25 done using the differentially expressed genes identified as described above as the predictor 
genes in combination with well known statistical algorithms as would be understood by a 
person skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 
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EXAMPLE 40 

Analysis of gene expression profiles of blood samples from individuals having 
Fibromyalgia as compared with gene expression profiles from normal individuals. 

5 This example demonstrates the use of the claimed invention to detect differential gene 

expression in blood samples taken from patients diagnosed with Fibromyalgia as compared to 
blood samples taken from healthy patients. 

As used herein "Fibromyalgia" refers to widespread chronic musculoskeletal pain and 
fatigue. The pain comes from the connective tissues, such as the muscles, tendons, and 
10 ligaments and does not involve the joints. Blood samples are taken from patients diagnosed with 
Fibromyalgia as defined herein. Gene expression profiles are then analysed and compared to 
profiles from patients unaffected by any disease. Preferably healthy patients are chosen who are 
age and sex matched to said patients diagnosed with disease. In each case, the diagnosis of 
Fibromyalgia is corroborated by a skilled Board certified physician. 

15 Total mRNA from a drop of peripheral whole blood is taken from each patient and 

isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with Fibromyalgia as compared to healthy patients is 

20 determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has Fibromyalgia, or does not have Fibromyalgia can be done using the 
25 differentially expressed genes identified as described above as the predictor genes in 

combination with well known statistical algorithms as would be understood by a person skilled 
in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 
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EXAMPLE 41 

Analysis of gene expression profiles of blood samples from individuals having Multiple 
Sclerosis as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
5 expression in blood samples taken from patients diagnosed with Multiple Sclerosis as compared 
to blood samples taken from healthy patients. 

As used herein "Multiple Sclerosis" refers to chronic progressive nervous disorder 
involving the loss of myelin sheath surrounding certain nerve fibres. Blood samples are taken 
from patients diagnosed with Multiple Sclerosis as defined herein. Gene expression profiles are 
10 then analysed and compared to profiles from patients unaffected by any disease. Preferably 

healthy patients are chosen who are age and sex matched to said patients diagnosed with disease. 
In each case, the diagnosis of Multiple Sclerosis is corroborated by a skilled Board certified 
physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with Multiple Sclerosis as compared to healthy patients 
is determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, 
Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 
2002). ■ 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has Multiple Sclerosis, or does not have Multiple Sclerosis can be done using 
the differentially expressed genes identified as described above as the predictor genes in 
25 combination with well known statistical algorithms as would be understood by a person skilled 
in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 
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EXAMPLE 42 

Analysis of gene expression profiles of blood samples from individuals having Muscular 
Dystrophy as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients diagnosed with Muscular Dystrophy as 
compared to blood samples taken from healthy patients. 

As used herein "Muscular Dystrophy" refers to a hereditary disease of the muscular 
system characterized by weakness and wasting of the skeletal muscles. Muscular Dystrophy 
includes Duchennes' Muscular Dystrophy, limb-girdle muscular dystrophy, myotonia atrophica, 
myotonic muscular dystrophy, pseudohypertrophic muscular dystrophy, and Steinhardt's disease. 

Blood samples are taken from patients diagnosed with Muscular Dystrophy as defined 
herein. Gene expression profiles are then analysed and compared to profiles from patients 
unaffected by any disease. Preferably healthy patients are chosen who are age and sex matched 
to said patients diagnosed with disease. In each case, the diagnosis of Muscular Dystrophy is 
corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with Muscular Dystrophy as compared to healthy 
patients is determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
(Glantz SA,. Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing 
Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has Muscular Dystrophy, or does not have Muscular Dystrophy can be done 
using the differentially expressed genes identified as described above as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person skilled 
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in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

EXAMPLE 43 

5 Analysis of gene expression profiles of blood samples from individuals having septic 

joint arthroplasty as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients diagnosed with septic joint arthroplasty as 
compared to blood samples taken from healthy patients. 

10 As used herein "septic joint arthroplasty" refers to an inflammation of the joint caused by 

a bacterial infection. 

Blood samples are taken from patients diagnosed with septic joint arthroplasty as defined 
herein. Gene expression profiles are then analysed and compared to profiles from patients 
unaffected by any disease. Preferably healthy patients are chosen who are age and sex matched 
15 to said patients diagnosed with disease. In each case, the diagnosis of septic joint arthroplasty is 
corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
20 Chip and/or a ChondroChip™ as described herein. I dentification of genes differentially 

expressed in blood samples from patients with septic joint arthroplasty as compared to healthy 
patients is determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
(Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing 
Division, 2002). 

25 Classification or class prediction of a test sample of an individual to determine whether 

said individuals has septic joint arthroplasty, or does not have septic joint arthroplasty can be 
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done using the differentially expressed genes identified as described above as the predictor 
genes in combination with well known statistical algorithms as would be understood by a 
person skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

5 

EXAMPLE 44 

Analysis of gene expression profiles of blood samples from individuals having 
Alzheimers Disease as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
10 expression in blood samples taken from patients diagnosed with Alzheimers as compared to 
blood samples taken from healthy patients. 

As used herein "Alzheimers" refers to a degenerative disease of the central nervous 
system characterized especially by premature senile mental deterioration. 

Blood samples are taken from patients diagnosed with Alzheimers as defined herein. 
15 Gene expression profiles are then analysed and compared to profiles from patients unaffected by 
any disease. Preferably healthy patients are chosen who are age and sex matched to said patients 
diagnosed with disease. In each case, the diagnosis of Alzheimers is corroborated by a skilled 
Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
20 isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with Alzheimers as compared to healthy patients is 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, 
25 Primer of Biostatistics, 5th ed. 5 New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 
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Classification or class prediction of a test sample of an individual to determine whether 
said individuals has Alzheimers, or does not have Alzheimers can be done using the 
differentially expressed genes identified as described above as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person skilled 
5 in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 



EXAMPLE 45 

Analysis of gene expression profiles of blood samples from individuals having hepatitis 
10 as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect gene expression in 
blood samples taken from patients diagnosed with hepatitis as compared to blood samples taken 
from healthy patients. 

As used herein "hepatitis" refers to an inflammation of the liver caused by a virus or 
15 toxin and can include hepatitis A, hepatitis B, hepatitis C, hepatitis D, hepatitis E, and hepatitis 
F. 

Blood samples are taken from patients diagnosed with hepatitis as defined herein. Gene 
expression profiles are then analysed and compared to profiles from patients unaffected by any 
disease. Preferably healthy patients are chosen who are age and sex matched to said patients 
20 diagnosed with disease. In each case, the diagnosis of hepatitis is corroborated by a skilled 
Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
25 Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with hepatitis as compared to healthy patients is 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, 
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Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has hepatitis, or does not have hepatitis can be done using the differentially 
5 expressed genes identified as described above as the predictor genes in combination with well 
known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 



10 EXAMPLE 46 

Analysis of gene expression profiles of blood samples from individuals having Manic 
Depression Syndrome (MDS) as compared with gene expression profiles from normal 
individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
15 expression in blood samples taken from patients diagnosed with MDS as compared to blood 
samples taken from healthy patients. 

As used herein "Manic Depression Syndrome (MDS)" refers to a mood disorder 
characterized by alternating mania and depression. 

Blood samples are taken from patients diagnosed with MDS as defined herein. Gene 
20 expression profiles are then analysed and compared to profiles from patients unaffected by any 
disease. Preferably healthy patients are chosen who are age and sex matched to said patients 
diagnosed with disease. In each case, the diagnosis of MDS is corroborated by a skilled Board 
certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
25 isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
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Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with MDS as compared to healthy patients is 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, 
Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 
5 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has MDS, or does not have MDS can be done using the differentially 
expressed genes identified as described above as the predictor genes in combination with well 
known statistical algorithms as would be understood by a person skilled in the art and 
10 described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 



EXAMPLE 47 

Analysis of gene expression profiles of blood samples from individuals having Crohn's 
15 Disease and/or Colitis as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from patients diagnosed with Crohn's Disease and/or Colitis 
as compared to blood samples taken from healthy patients. 

As used herein "Crohn's Disease" refers to a chronic inflammation of the ileum which is 
20 often progressive. As used herein "Colitis" or "Inflammatory Bowel Disease" refers to 
inflammation of the colon. 

Blood samples are taken from patients diagnosed with Crohn's and or Colitis as defined 
herein. Gene expression profiles are then analysed and compared to profiles from patients 
unaffected by any disease. Preferably healthy patients are chosen who are age and sex matched 
25 to said patients diagnosed with disease. In each case, the diagnosis of Crohn's and or Colitis is 
corroborated by a skilled Board certified physician. 
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Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
5 expressed in blood samples from patients with Crohn's and or Colitis as compared to healthy 
patients is determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
(Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing 
Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
10 said individuals has Crohn's and or Colitis, or does not have Crohn's and or Colitis can be 
done using the differentially expressed genes identified as described above as the predictor 
genes in combination with well known statistical algorithms as would be understood by a 
person skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

15 

EXAMPLE 48 

Analysis of gene expression profiles of blood samples from individuals having Malignant 
Hyperthermia Susceptibility as compared with gene expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
20 expression in blood samples taken from patients diagnosed with Malignant Hyperthermia 
Susceptibility as compared to blood samples taken from healthy patients. 

As used herein "Malignant Hyperthermia Susceptibility" refers to a pharmacogenetic 
disorder of skeletal muscle calcium regulation often developing during or after a general 
anaesthesia. 

25 Blood samples are taken from patients diagnosed with Malignant Hyperthermia 

Susceptibility as defined herein. Gene expression profiles are then analysed and compared to 
profiles from patients unaffected by any disease. Preferably healthy patients are chosen who are 

118 



Attorney Docket No.: 4231/2055C 



age and sex matched to said patients diagnosed with disease. In each case, the diagnosis of 
Malignant Hyperthermia Susceptibility is corroborated by a skilled Board certified physician. 

Total mRNA from a drop of peripheral whole blood is taken from each patient and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample 
5 is generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. Identification of genes differentially 
expressed in blood samples from patients with Malignant Hyperthermia Susceptibility as 
compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: McGraw- 
10 Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether 
said individuals has Malignant Hyperthermia Susceptibility, or does not have Malignant 
Hyperthermia Susceptibility can be done using the differentially expressed genes identified as 
described above as the predictor genes in combination with well known statistical algorithms 
15 as would be understood by a person skilled in the art and described herein. Commercially 
available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 

EXAMPLE 49 

20 Analysis of gene expression profiles of blood samples from horses having osteoarthritis 

as compared with gene expression profiles from normal or non-osteoarthritic horses. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in blood samples taken from horses so as to diagnose equine arthritis as compared to 
blood samples taken from healthy horses. 

25 As used herein "arthritis" in reference to horses refers to a degenerative joint disease that 

affects horses by causing lameness. Although it can appear in any joint, most common areas are 
the upper knee joint, front fetlocks, hocks, or coffin joints in the front feet. The condition can be 
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caused by trauma, mineral or dietary deficiency, old age, poor conformation, over exertion or 
infection. The different structures that can be damaged in arthritis are the cartilage inside joints, 
the bone in the joints, the joint capsule, the synovial membranes, the ligaments around the joints 
and lastly the fluid that lubricates the insides of 'synovial joints'. In severe cases all of these 
5 structures are affected. In for example osteochondrosis only the cartilage may be affected. 

Regardless of the cause, the disease begins when the synovial fluid that lubricates healthy 
joints begins to thin. The decrease in lubrication causes the cartilage cushion to break down, and 
eventually the bones begin to grind painfully against each other. Diagnostic tests used to 
confirm arthritis include X-rays, joint fluid analysis, and ultrasound. 

10 Blood samples are taken from horses diagnosed with arthritis as defined herein. Gene 

expression profiles are then analysed and compared to profiles from horses unaffected by any 
disease. Preferably healthy horses are chosen who are age and sex matched to said horses 
diagnosed with disease. In each case, the diagnosis of arthritis is corroborated by a certified 
veterinarian. 

Total mRNA from a drop of peripheral whole blood is taken from each horse and isolated 
using TRIzol® reagent (GDBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. An equine specific microarray representing 
the equine genome can also be used. Identification of genes differentially expressed in blood 
samples from horses with arthritis as compared to healthy horses is determined by statistical 
analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th 
ed., New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of a horse to determine whether said 
horse has arthritis or does not have arthritis can be done using the differentially expressed 
25 genes identified as described above as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skilled in the art and described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 
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EXAMPLE 50 

Analysis of gene expression profiles of blood samples from dogs having osteoarthritis as 
compared with gene expression profiles from normal or non-osteoarthritic dogs. 

This example demonstrates the use of the claimed invention to detect differential gene 
5 expression in blood samples taken from dogs so as to diagnose equine arthritis as compared to 
blood samples taken from healthy horses. 

As used herein "osteoarthritis" in reference to dogs is a form of degenerative joint disease 
which involves the deterioration of and changes to the cartilage and bone. In response to 
inflammation in and about the joint, the body responds with bony remodeling around the joint 
10 structure. This process can be slow and gradual with minimal outward symptoms, or more 
rapidly progressive with significant pain and discomfort. Osteoarthritic changes can occur in 
response to infection and injury of the joint as well. 

Blood samples are taken from dogs diagnosed with osteoarthritis as defined herein. Gene 
expression profiles are then analysed and compared to profiles from dogs unaffected by any 
15 disease. Preferably healthy dogs are chosen who are age, sex and breed matched to said dogs 
diagnosed with disease. In each case, the diagnosis of osteoarthritis is corroborated by a certified 
veterinarian. 

Total mRNA from a drop of peripheral whole blood is taken from each dog and isolated 
using TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix U133A 
Chip and/or a ChondroChip™ as described herein. A canine specific microarray representing 
the canine genome can also be used. Identification of genes differentially expressed in blood 
samples from dogs with osteoarthritis as compared to healthy horses is determined by statistical 
analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th 
ed., New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of a dog to determine whether said 
dog has osteoarthritis or does not have osteoarthritis can be done using the differentially 
expressed genes identified as described above as the predictor genes in combination with well 
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known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 

5 EXAMPLE 51 

Analysis of gene expression profiles of blood samples from individuals having Manic 
Depression Syndrome (MDS) as compared with gene expression profiles from individuals 
having Schizophrenia. 

This example demonstrates the use of the claimed invention to detect differential gene 
10 expression in blood samples taken from patients diagnosed with MDS as compared to blood 
samples taken from schizophrenic patients. 

As used herein "Manic Depression Syndrome (MDS)" refers to a mood disorder 
characterized by alternating mania and depression. As used herein, "schizophrenia" is defined as 
a psychotic disorders characterized by distortions of reality and disturbances of thought and 
language and withdrawal from social contact. Patients diagnosed with "schizophrenia" can 
include patients having any of the following diagnosis: an acute schizophrenic episode, 
borderline schizophrenia, catatonia, catatonic schizophrenia, catatonic type schizophrenia, 
disorganized schizophrenia, disorganized type schizophrenia, hebephrenia, hebephrenic 
schizophrenia, latent schizophrenia, paranoic type schizophrenia, paranoid schizophrenia, 
paraphrenia, paraphrenic schizophrenia, psychosis, reactive schizophrenia or the like. 

Blood samples are taken from patients diagnosed with MDS or Schizophrenia as defined 
herein. Gene expression profiles are then analyzed and compared to profiles from patients 
unaffected by any disease. Preferably healthy patients are chosen who are age and sex matched 
to said patients diagnosed with disease. In each case, the diagnosis of MDS and Schizophrenia is 
25 corroborated by a skilled Boardcertified physician. Total mRNA from a drop of peripheral 
whole blood is taken from each patient and isolated using TRIzol* reagent (GIBCO) and 
fluorescently labeled probes for each blood sample is generated as described above. 
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Each probe is denatured and hybridized to a Affymetrix U133A Chip and/or a 
ChondroChip(tm) as described herein. Identification of genes differentially expressed in blood 
samples from patients with MDS as compared to Schizophrenic patients as compared to normal 
individuals is determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
5 (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing 
Division, 2002) (data not shown). 294 genes were identified as being differentially expressed 
with a p value of < 0.05 as between the schizophrenic patients, the MDS patients and those 
control individuals. The identity of the differentially expressed genes is shown in Table 3 AC. 

Classification or class prediction of a test sample of an individual to determine whether 
10 said individuals has MDS, has Schizophrenia or is normal can be done using the differentially 
expressed genes identified as described above as the predictor genes in combination with well 
known statistical algorithms as would be understood by a person skilled in the art and described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring (tm) ) for Class Predication are also available. 

One skilled in the art will appreciate readily that the present invention is well adapted to 
carry out the objects and obtain the ends and advantages mentioned, as well as those objects, 
ends and advantages inherent herein. The present examples, along with the methods, procedures, 
treatments, molecules, and specific compounds described herein are presently representative of 
preferred embodiments, are exemplary, and are not intended as limitations on the scope of the 
invention. Changes therein and other uses will occur to those skilled in the art which are 
encompassed within the spirit of the invention as defined by the scope of the claims. 

All patents, patent applications, and published references cited herein are hereby 
incorporated by reference in their entirety. While this invention has been particularly shown and 
described with references to preferred embodiments thereof, it will be understood by those 
25 skilled in the art that various changes in form and details may be made therein without departing 
from the scope of the invention encompassed by the appended claims. 



123 



Attorney Docket No.: 4231/2055C 



EXAMPLE 52 

Analysis of Gene Expression Profiles of Blood Samples of Individuals So as to Predict 
Progression of Osteoarthritis. 

This example demonstrates the use of the claimed invention to predict the progression of 
Osteoarthritis. 

As used herein "osteoarthritis" is a form of degenerative joint disease which involves the 
deterioration of and changes to the cartilage and bone. In response to inflammation in and about 
the joint, the body responds with bony remodeling around the joint structure. This process can be 
slow and gradual with minimal outward symptoms, or more rapidly progressive with significant 
pain and discomfort. Osteoarthritic changes can occur in response to infection and injury of the 
joint as well. 

Blood samples are taken from test individuals not having any symptoms of osteoarthritis 
and gene expression profiles are then analyzed and compared to profiles from individuals having 
mild osteoarthritis. 

Classification or class prediction of a test sample of said individual to determine 
whether said individual has mild osteoarthritis or does not have osteoarthritis can be done 
using the differentially expressed genes identified as described herein as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person skilled 
in the art and described herein. Commercially available programs such as those provided by 
Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Individuals identified with mild osteoarthritis as a result of this classification have a 
significantly greater chance of developing moderate, marked and/or severe osteoarthris than 
those individuals not diagnosed with mild osteoarthritis. 
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